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NUCLEIC ACID MOLECULES ENCODING TRANSMEMBRANE SERINE 
PROTEASES, THE ENCODED PROTEINS AND METHODS BASED 

THEREON 

RELATED APPLICATIONS 

5 For U.S. purposes, benefit of priority under 35 U.S.C. §1 19(e) to 

U.S. provisional application Serial No. 60/179,982, to Edwin L. Madison 
and Edgar O. Ong, filed February 3, 2000, entitled "NUCLEOTIDE AND 
PROTEIN SEQUENCES OF A TRANSMEMBRANE SERINE PROTEASE AND 
METHODS BASED THEREOF"; to U.S. provisional application Serial No. 

10 60/183,542, to Edwin L. Madison and Edgar O. Ong, filed February 18, 
2000, entitled "NUCLEOTIDE AND PROTEIN SEQUENCES OF A TRANS- 
MEMBRANE SERINE PROTEASE AND METHODS BASED THEREOF"; to 
U.S. provisional application Serial No. 60/213,124, to Edwin L. Madison 
and Edgar O. Ong, filed June 22, 2000, entitled "NUCLEOTIDE AND 

15 PROTEIN SEQUENCES OF A TRANSMEMBRANE SERINE PROTEASE AND 
METHODS BASED THEREOF"; to U.S. provisional application Serial No. 
60/220,970, to Edwin L. Madison and Edgar O. Ong, filed July 26, 
2000, entitled "NUCLEOTIDE AND PROTEIN SEQUENCES OF A TRANS- 
MEMBRANE SERINE PROTEASE AND METHODS BASED THEREOF"; and 

20 to U.S. provisional application Serial No. 06/234,840 to Edwin L. 

Madison, Edgar O. Ong and Jiunn-Chern Yeh, filed September 22, 2000, 
entitled "NUCLEIC ACID MOLECULES ENCODING TRANSMEMBRANE 
SERINE PROTEASES, THE ENCODED PROTEINS AND METHODS BASED 
THEREON" is claimed herein. Benefit of priority under 35 U.S.C. §120 to 

25 U.S. application Serial No. 09/657,968, to Edwin L. Madison, Joseph 
Edward Semple, Gary Samuel Coombs, John Eugene Reiner, Edgar O. 
Ong, Gian Luca Araldi, filed September 8, 2000, entitled "INHIBITORS OF 
SERINE PROTEASE ACTIVITY OF MATRIPTASE OR MTSP1 " is also 
claimed herein. This application is a continuation-in-part of U.S. 

30 application Serial No. 09/657,986. For international purposes, benefit of 
priority to each of the above-noted applications is claimed herein. 
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This application is related to U.S. provisional application Serial No. 
60/166,391 to Edwin L. Madison and Edgar O. Ong, filed November 18, 
1999 entitled "NUCLEOTIDE AND PROTEIN SEQUENCES OF PROTEASE 
DOMAINS OF ENDOTHELIASE AND METHODS BASED THEREON". This 
a application is also related to International PCT application No. 
PCT/USOO/31803, filed November 17, 2000. 

The above-noted provisional applications, patent application and 
International PCT application are incorporated by reference in their 
entirety. All patents, applications, published applications and other 
publications and sequences from GenBank and other data bases referred 
to herein are incorporated by reference in their entirety. 
FIELD OF INVENTION 

Nucleic acid molecules that encode proteases and portions thereof, 
particularly protease domains are provided. Also provided are prognostic, 
diagnostic and therapeutic methods using the proteases and domains 
thereof and the encoding nucleic acid molecules. 
BACKGROUND OF THE INVENTION AND OBJECTS THEREOF 

Cancer a leading cause of death in the United States, developing in 
one in three Americans; one of every four Americans dies of cancer. 
Cancer is characterized by an increase in the number of abnormal 
neoplastic cells, which proliferate to form a tumor mass, the invasion of 
adjacent tissues by these neoplastic tumor cells, and the generation of 
malignant cells that metastasize via the blood or lymphatic system to 
regional lymph nodes and to distant sites. 

Among the hallmarks of cancer is a breakdown in the communi- 
cation among tumor cells and their environment. Normal cells do not 
divide in the absence of stimulatory signals, cease dividing in the 
presence of inhibitory signals. Growth-stimulatory and growth-inhibitory 
signals, are routinely exchanged between cells within a tissue. In a 
cancerous, or neoplastic, state, a cell acquires the ability to "override" 
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these signals and to proliferate under conditions in which normal cells do 
not grow. 

In order to proliferate tunnor cells acquire a nunnber of distinct 
aberrant traits reflecting genetic alterations. The genomes of certain well- 
5 studied tumors carry several different independently altered genes, 

including activated oncogenes and inactivated tumor suppressor genes. 
Each of these genetic changes appears to be responsible for imparting 
some of the traits that, in the aggregate, represent the full neoplastic 
phenotype. 

10 A variety of biochemical factors have been associated with 

different phases of metastasis. Cell surface receptors for collagen, 
glycoproteins such as laminin, and proteoglycans, facilitate tumor cell 
attachment, an important step in invasion and metastases. Attachment 
triggers the release of degradative enzymes which facilitate the 

15 penetration of tumor cells through tissue barriers. Once the tumor cells 
have entered the target tissue, specific growth factors are required for 
further proliferation. Tumor invasion (or progression) involves a complex 
series of events, in which tumor cells detach from the primary tumor, 
break down the normal tissue surrounding it, and migrate into a blood or 

20 lymphatic vessel to be carried to a distant site. The breaking down of 
normal tissue barriers is accomplished by the elaboration of specific 
enzymes that degrade the proteins of the extracellular matrix that make 
up basement membranes and stromal components of tissues. 

A class of extracellular matrix degrading enzymes have been 

25 implicated in tumor invasion. Among these are the matrix 

metalloproteinases (MMP), For example, the production of the matrix 
metalloproteinase stromelysin is associated with malignant tumors with 
metastatic potential (see, e.g., McDonnell et al. (1990) Smnrs. in Cancer 
Biology 7:107-115; McDonnell et al. (1990) Cancer and Metastasis 

30 Reviews 5:309-319). 
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The capacity of cancer cells to metastasize and invade tissue is 
facilitated by degradation of the basement membrane. Several proteinase 
enzymes, including the MMPs, have been reported to facilitate the 
process of invasion of tumor cells. MMPs are reported to enhance 
degradation of the basement membrane, which thereby permits tumorous 
cells to invade tissues. For example, two major metalloproteinases 
having molecular weights of about 70 kDa and 92 kDa appear to enhance 
ability of tumor cells to metastasize. 

Type li Transmembrane Serine Proteases (TTSPs) 

In addition to the MMPs, serine proteases have been implicated in 
neoplastic disease progression. Most serine proteases, which are either 
secreted enzymes or are sequestered in cytoplasmic storage organelles, 
have roles in blood coagulation, wound healing, digestion, immune 
responses and tumor invasion and metastasis. A class cell surface 
proteins designated type II transmembrane serine proteases, which are 
membrane-anchored proteins with N-terminal extracellular domains, has 
been identified. As cell surface proteins, they are positioned to play a 
role in intracellular signal transduction and in mediating cell surface 
proteolytic events. 

Cell surface proteolysis is a mechanism for the generation of 
biologically active proteins that mediate a variety of cellular functions. 
These membrane-anchored proteins, include a disintegrin-like and 
metalloproteinase (ADAM) and membrane-type matrix metalloproteinase 
(MT-MMP). In mammals, at least 17 members of the family are known, 
including seven in humans (see. Hooper et al. (2001) J. Biol. Chem. 
276:857-860). These include: corin (accession nos. AFl 33845 and 
AB013874; see, Yan et aL (1999) J. Biol. Chem. 274:14926-14938; 
Tomia et al. (1998) J. Biochem. 724:784-789; Uan et al. (2000) Proc. 
NatL Acad. Sci. U.S.A. S7:8525-8529); enterpeptidase (also designated 
enterokinase; accession no. U09860 for the human protein; see, Kitamoto 
et aL (1995) Biochem. 27\ 4562-4568; Yahagi et al. (1996) Biochem. 
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Biophys. Res. Commun. 2 75:806-812; Kitamoto et al. (1994) Proc. Natl. 
Acad. Sci. U.S.A. 57:7588-7592; Matsushima et al. (1994) J. Biol. 
Chem. 255:19976-19982;); human airway trypsin-like protease (HAT; 
accession no. AB002134; see Yamaoka et al. J. Biol. Chem. 275:1 1894- 
5 11901); MTSP1 and matriptase (also called TADG-15; see SEQ ID Nos. 1 
and 2; accession nos. API 33086/AF1 1 8224, AF04280022; Takeuchi et 
al. (1999) Proc. Natl. Acad. Sci. U.S.A. 56:11054-1161; Lin et al. (1999) 
J. Biol. Chem. 274/18231-18236; Takeuchi et al. (2000) J. Biol. Chem. 
27 5:26333-2^3 A2; and Kim et al. (1999) Immunogenetics 45:420-429); 

10 hepsin (see, accession nos. Ml 8930, AF030065, X70900; Leytus et al. 
(1988) Biochem. 27: 11895-11901; Vu et al. (1997) J. Biol. Chem. 
272:31315-31320; and Farley et al. (1993) Biochem. Biophys. Acta 
7 773:350-352; and see, U.S. Patent No. 5,972,616); TMPRS2 (see. 
Accession Nos. U75329 and AP1 13596; Paoloni-Giacobino et al. (1997) 

15 Genomics 44:309-320; and Jacquinet et al. (2000) FEBS Lett. 468: 93- 
100); and TMPRSS4 (see. Accession No. NM 016425; Wallrapp et al. 
(2000) Ca/7ce/- 60:2602-2606). 

Serine proteases, including transmembrane serine proteases, have 
been implicated in processes involved in neoplastic development and 

20 progression. While the precise role of these proteases has not been 
elaborated, serine proteases and inhibitors thereof are involved in the 
control of many intra- and extracellular physiological processes, including 
degradative actions in cancer cell invasion, metastatic spread, and 
neovascularization of tumors, that are involved in tumor progression. It is 

25 believed that proteases are involved in the degradation of extracellular 
matrix (ECM) and contribute to tissue remodeling, and are necessary for 
cancer invasion and metastasis. The activity and/or expression of some 
proteases have been shown to correlate with tumor progression and 
development. 
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For example, a membrane-type serine protease MTSP1 (also called 
matriptase; see SEQ ID Nos. 1 and 2 from U.S. Patent No. 5,972,616; 
and GenBank Accession No. AFl 18224; (1999) J. BioL Chem. 
274:18231-18236; U.S. Patent No. 5,792,616; see, also Takeuchi 
(1999) Proc, Natl. Acad. Sci. U.S.A. 55:1 1054-1 161) that is expressed in 
epithelial cancer and normal tissue (Takeucuhi et al. (1999) Proc. Natl. 
Acad. Sci. USA, 96(20) : 1 1054-61) has been identified. Matriptase was 
originally identified in human breast cancer ceils as a major gelatinase 
(see, U.S. Patent No. 5,482,848), a type of matrix metalloprotease 
(MMP). It has been proposed that it plays a role in the metastasis of 
breast cancer. Its primary cleavage specificity is Arg-Lys residues. 
Matriptase also is expressed in a variety of epithelial tissues with high 
levels of activity and/or expression in the human gastrointestinal tract and 
the prostate. 

Prostate-specific antigen (PSA), a kallikrein-like serine protease, 
degrades extracellular matrix glycoproteins fibronectin and laminin, and, 
has been postulated to facilitate invasion by prostate cancer cells 
(Webber a/. (1995) Clin. Cancer Res.. 1 (1 0) :1 089-94). Blocking PSA 
proteolytic activity with PSA-specific monoclonal antibodies results in a 
dose-dependent decrease in vitro in the invasion of the reconstituted 
basement membrane Matrigel by LNCaP human prostate carcinoma cells 
which secrete high levels of PSA. 

Hepsin, a cell surface serine protease identified in hepatoma cells, 
is overexpressed in ovarian cancer (Tanimoto et al. (1997) Cancer Res. , 
57(14) :2884-7)- The hepsin transcript appears to be abundant in 
carcinoma tissue and is almost never expressed in normal adult tissue, 
including normal ovary. It has been suggested that hepsin is frequently 
overexpressed in ovarian tumors and therefore may be a candidate 
protease in the invasive process and growth capacity of ovarian tumor 
cells. 
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A serine protease-like gene, designated normal epithelial 
cell-specific 1 (NES1) (Liu et a!.. Cancer Res., 56(14) :3371-9 (1996)) has 
been identified. Although expression of the NESl mRNA is observed in 
all normal and immortalized nontumorigenic epithelial cell lines, the 
5 majority of human breast cancer cell lines show a drastic reduction or a 
complete lack of its expression. The structural similarity of NESl to 
polypeptides known to regulate growth factor activity and a negative 
correlation of NESl expression with breast oncogenesis suggest a direct 
or indirect role for this protease-like gene product in the suppression of 
10 tumorigenesis. 

Hence transmembrane serine proteases appear to be involved in the 
y etiology and pathogenesis of tumors. There is a need to further elucidate 

"^^J their role in these processes and to identify additional transmembrane 

fi% proteases. Therefore, it is an object herein to provide transmembrane 

15 serine protease (MTSP) proteins and nucleic acids encoding such MTSP 
proteases that are involved in the regulation of or participate in 
tumorigenesis and/or carcinogenesis. It is also an object herein to provide 
IJfl prognostic, diagnostic, therapeutic screening methods using the such 

rlj proteases and the nucleic acids encoding such proteases. 

20 SUMMARY OF THE INVENTION 

Provided herein are isolated protease domains of the Transmem- 
brane Serine Protease family, particularly the Type II Transmembrane 
Serine Protease (TTSP) family (also referred to herein as MTSPs), and 
more particularly TTSP family members whose functional activity differs 
25 in tumor cells from non-tumor cells in the same tissue. For example, the 
MTSPs include those that are activated and/or expressed in tumor cells at 
different levels, typically higher, from non-tumor cells; and those from 
cells in which substrates therefor differ in tumor cells from non-tumor 
cells or otherwise alter the specificity of the MTSP, 
30 The MTSP family as intended herein does not include any 

membrane anchored or spanning proteases that are expressed on 
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endothelial cells. Included among the MTSPs are several heretofore 
unidentified MTSP family members, designated herein as MTSPS and 
MTSP4 and a new form of a protein designated herein as MTSP6. In 
addition to the protease domains of each of MTSPS and MTSP4, the full- 
5 length proteins, including those that results from splice variants, 
zymogens and activated forms, and uses thereof, are also provided. 

The protease domains as provided herein are single-chain poly- 
peptides, with an N-terminus (such as IV, VV, IL and II) generated at the 
cleavage site (generally having the consensus sequence RiVVGG, 
10 RilVGG, RilVNQ, R4ILGG, R4VGLL, R4ILGG or a variation thereof; an 
N-terminus RiV or Ril, where the arrow represents the cleavage point) 
when the zymogen is activated. To identify the protease domain an Ri 
should be identified, and then the following amino acids compared to the 
above noted motif, 

15 The protease domains generated herein, however, do not result 

from activation, which produces a two chain activated product, but rather 
are single chain polypeptides with the N-terminus Include the consensus 
sequence 1 VVGG, IIVGG, iVGLL, IlLGG or 4IVNG or other such motif 
at the N-terminus. As shown herein, such polypeptides, although not the 

20 result of activation and not double-chain forms, exhibit proteolytic 
(catalytic) activity. These protease domain polypeptides are used in 
assays to screen for agents that modulate the activity of the MTSP. Such 
assays are also provided herein. In exemplary assays, the affects of test 
compounds in the ability of a protease domains to proteolytically cleave a 

25 known substrate, typically a fluorescently, chromogenically or otherwise 
detectably labeled substrate, are assessed. Agents, generally 
compounds, particularly small molecules, that modulate the activity of the 
protease domain are candidate compounds for modulating the activity of 
the MTSP. The protease domains can also be used to produce single- 

30 chain protease-specific antibodies. The protease domains provided 

herein include, but are not limited to, the single chain region having an N- 
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terminus at the cleavage site for activation of the zymogen, through the 
C-terminus, or C-terminal truncated portions thereof that exhibit 
proteolytic activity as a single-chain polypeptide in in vitro proteolysis 
assays, of any MTSP family member, preferably from a mammal, inclu- 
5 ding and most preferably human, that, for example, is expressed in tumor 
cells at different levels from non-tumor cells, and that is not expressed on 
an endothelial cell. These include, but are not limited to : MTSP1 (or 
matriptase), MTSP3, MTSP4 and MTSP6. Other MTSP protease domains 
of interest herein, particularly for use in in vitro drug screening proteolytic 

10 assays, include, but are not limited to: corin (accession nos. API 33845 
and AB013874; see, Yan et al. (1999) J. Biol, Chem. 274:14926-14938; 
Tomia et al. (1998) J. Biochem. Uan et al. (2000) Proc. 

Natl. Acad, Sci. U.S.A. 57:8525-8529; SEQ ID Nos. 61 and 62 for the 
human protein); enterpeptidase (also designated enterokinase; accession 

15 no. U09860 for the human protein; see, Kitamoto et al. (1995) Biochem. 
27: 4562-4568; Yahagi eta/. (1996) Biochem. Biophys. Res. Commun. 
275:806-812; Kitamoto et al. (1994) Proc. Natl. Acad. Sci. U.S.A. 
57:7588-7592; Matsushima a/. (1994) J. Biol, Chem. 255:19976- 
19982; see SEQ ID Nos. 63 and 64 for the human protein); human airway 

20 trypsin-like protease (HAT; accession no. AB002134; see Yamaoka et al. 
J. Biol. Chem. 275:1 1 894-1 1 901 ; SEQ ID Nos, 65 and 66 for the human 
protein); hepsin (see, accession nos. Ml 8930, AF030065, X70900; 
Leytus et al. (1988) Biochem. 27: 11895-11901; Vu et al. (1997) J. 
Biol. Chem. 272:31315-31320; and Farley et al. (1993) Biochem. 

25 Biophys. Acta 7 7 75:350-352; SEQ ID Nos. 67 and 68 for the human 

protein); TMPRS2 (see. Accession Nos. U75329 and AP1 13596; Paoloni- 
Giacobino et al. (1997) Genomics 44:309-320; and Jacquinet et aL 
(2000) FEBS Lett. 468: 93-100; SEQ ID Nos. 69 and 70 for the human 
protein) TMPRSS4 (see. Accession No. NM 016425; Wallrapp et al. 

30 (2000) Cancer 50:2602-2606; SEQ ID Nos. 71 and 72 for the human 

protein); and TADG-12 (also designated MTSP6, see SEQ ID Nos. 1 1 and 
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12; see International PCT application No. WO 00/52044, which claims 
priority to U.S. application Serial No. 09/261,416). 

Also provided are muteins of the single chain protease domains and 
MTSPs, particularly muteins in which the Cys residue in the protease 
5 domain that is free (i.e., does not form disulfide linkages with any other 
Cys residue in the protein) is substituted with another amino acid 
substitution, preferably with a conservative amino acid substitution or a 
substition that does not eliminate the activity, and muteins in which a 
glycosylation site(s) is eliminated. Muteins in which other conservative 
10 amino acid substitutions in which catalytic activity is retained are also 

contemplated (see, e.g., Table 1, for exemplary amino acid substitutions). 
See, also. Figure 4, which identifies the free Cys residues in MTSPS, 
MTSP4 and MTSP6. 

Hence, provided herein are members of a family of transmembrane 
15 serine protease (MTSP) proteins, and functional domains, especially 

protease {or catalytic) domains thereof, muteins and other derivatives and 
analogs thereof. Also provided herein are nucleic acids encoding the 
MTSPs. 

Exemplary MTSPs (see, e.g., SEQ ID No. 1-12, 49 and 50) are 
20 provided herein, as are the single chain protease domains thereof as 
follows: SEQ ID Nos. 1, 2, 49 and 50 set forth amino acid and nucleic 
acid sequences of MTSP1 and the protease domain thereof; SEQ ID No. 3 
sets forth the MTSPS nucleic acid sequence and SEQ ID No. 4 the 
encoded MTSPS amino acids; SEQ ID No. 5 MTSP4 a nucleic acid 
25 sequence of the protease domain and SEQ ID No. 6 the encoded MTSP4 
amino acid protease domain; SEQ ID No. 7 MTSP4-L a nucleic acid 
sequence and SEQ ID No. 8 the encoded MTSP4-L amino acid sequence; 
SEQ ID No. 9 an MTSP4-S encoding nucleic acid sequence and SEQ ID 
No. 10 the encoded MTSP4-S amino acid sequence; and SEQ ID No. 1 1 
30 an MTSP6 encoding nucleic acid sequence and SEQ ID No. 12 the 
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encoded MTSP6 amino acid sequence. The single chain protease 
domains of each are delineated below. 

Nucleic acid molecules that encode a single-chain protease domain 
or catalytically active portion thereof are provided. Also provided are 
5 nucleic acid molecules that hybridize to such MTSP encoding nucleic acid 
along their full length and encode the protease domain or portion thereof 
are provided. Hybridization is preferably effected under conditions of at 
least low, generally at least moderate, and often high stringency. 

Additionally provided herein are antibodies that specifically bind to 

10 the MTSPs, cells, combinations, kits and articles of manufacture that 
contain the nucleic acid encoding the MTSP and/or the MTSP. Further 
provided herein are prognostic, diagnostic, therapeutic screening methods 
using MTSPs and the nucleic acids encoding MTSP. Also provided are 
transgenic non-human animals bearing inactivated genes encoding the 

15 MTSP and bearing the genes encoding the MTSP under non-native 

promotor control are provided. Such animals are useful in animal models 
of tumor initation, growth and/or progression models. 

Provided herein are members of a family of membrane serine 
proteases (MTSP) that are expressed in certain tumor or cancer cells such 

20 lung, prostate, colon and breast cancers. In particular, it is shown herein, 
that MTSPs, particularly, MTSPS, MTSP4 and MTSP6 are expressed in 
lung carcinoma, breast carcinoma, colon adenocarcinoma and/or ovarian 
carcinomas as well as in certain normal cells and tissues (see e.g., 
EXAMPLES for tissue-specific expression profiles of each protein 

25 exemplified herein). The MTSPs that are of particular interest herein, are 
those that are expressed in tumor cells, for example, those that appear to 
be expressed at different levels in tumor cells from normal cells, or whose 
functional activity is different in tumor cells from normal cells, such as by 
an alteration in a substrate therefor, or a cofactor. Hence the MTSP 

30 provided herein can serve as diagnostic markers for certain tumors. The 
level of activated MTSPS, MTSP4 and MTSP6 can be diagnostic of 
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prostate cancer. In addition, MTSP4 is expressed and/or activated in 
lymphomas, leukemias, lung caner, breast, prostrate and co\on cancers. 
MTSPe is activated and/or expressed in breast, lung, prostate, colon and 
ovarian cancers. Furthermore, compounds that modulate the activity of 
5 these MTSPs, as assessed by the assays provided herein, particularly the 
in vitro proteolytic assays that use the single chain protease domains, are 
potential therapeutic candidates for treatment of various malignancies and 
neoplastic disease. 

Also provided herein are methods of modulating the activity of the 

10 MTSPs and screening for compounds that modulate, including inhibit, 
antagonize, agonize or otherwise alter the activity of the MTSPs. Of 
particular interest is the extracellular domain of these MTSPs that includes 
the proteolytic (catalytic) portion of the protein. 

MTSP proteins, including, but not limited to, MTSPS, MTSP4, and 

15 MTSP6, including splice variants thereof, and nucleic acids encoding 

MTSPs, and domains, derivatives and analogs thereof are provided herein. 
Single chain protease domains, in the N-terminal is that which would be 
generated by activation of the zymogen, from any MTSP, particularly 
those that are not expressed in endothelial cells and that are expressed in 

20 tumor cells are also provided. 

Antibodies that specifically bind to the MTSP, particularly the single 
chain protease domain, and any and all forms of MTSPS and MTSP4, and 
cells, combinations, kits and articles of manufacture containing the MTSP 
proteins, domains thereof, or encoding nucleic acids are also provided 

25 herein. Transgenic non-human animals bearing inactivated genes 

encoding the MTSP and bearing the genes encoding the MTSP under a 
non-native promoter control are additionally provided herein. Also 
provided are nucleic acid molecules encoding each of the MTSPs and 
domains thereof. 

30 Also provided are plasmids containing any of the nucleic acid 

molecules provided herein. Cells containing the plasmids are also 
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provided. Such cells include, but are not limited to, bacterial cells, yeast 
cells, fungal cells, plant cells, insect cells and animal cells. 

Also provided is a method of producing a MTSP by growing the 
above-described cells under conditions whereby the MTSP is expressed 
by the cells, and recovering the expressed MTSP protein. Methods for 
isolating nucleic acid encoding other MTSPs are also provided. 

Also provided are cells, preferably eukaryotic cells, such as 
mammalian cells and yeast cells, in which the MTSP protein, preferably 
MTSP3 and MTSP4, is expressed in the surface of the cells. Such cells 
are used in drug screening assays to identify compounds that modulate 
the activity of the MTSP protein. These assays including in vitro binding 
assays, and transcription based assays in which signal transduction 
mediated by the MTSP is assessed. 

Further provided herein are prognostic, diagnostic and therapeutic 
screening methods using the MTSP and the nucleic acids encoding MTSP. 
In particular, the prognostic, diagnostic and therapeutic screening 
methods are used for preventing^ treating, or for finding agents useful in 
preventing or treating, tumors or cancers such as lung carcinoma, colon 
adenocarcinoma and ovarian carcinoma. 

Also provided are methods for screening for compounds that 
modulate the activity of any MTSP. The compounds are identified by 
contacting them with the MTSP and a substrate for the MTSP. A change 
in the amount of substrate cleaved in the presence of the compounds 
compared to that in the absence of the compound indicates that the 
compound modulates the activity of the MTSP. Such compounds are 
selected for further analyses or for use to modulate the activity of the 
MTSP, such as inhibitors or agonists- The compounds can also be 
identified by contacting the substrates with a cell that expresses the 
MTSP or the extracellular domain or proteolytically active portion thereof. 
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For assays in which the extracellular domain or a proteolytically active 
portion thereof is employed, the MTSP is any MTSP that is expressed on 
cells, other than endothelial cells, including, but not limited to MTSP1, 
MTSP3, 1\/1TSP4 and MTSP6. 
5 Also provided herein are modulators of the activity of the MTSP, 

especially the modulators obtained according to the screening methods 
provide herein. Such modulators may have use in treating cancerous 
conditions, and other neoplastic conditions. 

Pharmaceutical composition containing the protease domains of an 
10 MTSP protein, and the MTSP proteins, MTSP3, MTSP4 and MTSP6 are 
provided herein in a pharmaceutically acceptable carrier or excipient are 
provided herein. 

Also provided are articles of manufacture that contain the MTSP 
proteins and protease domains of MTSPs in single chain form. The 

15 articles contain a) packaging material; b) the polypeptide (or encoding 
nucleic acid), particularly the single chain protease domain thereof; and 
c) a label indicating that the article is for using ins assays for identifying 
modulators of the activities of an MTSP protein is provided herein. 

Conjugates containing a) a MTSP protease domain in single chain 

20 from; and b) a targeting agent linked to the MTSP directly or via a linker, 
wherein the agent facilitates: i) affinity isolation or purification of the 
conjugate; ii) attachment of the conjugate to a surface; iii) detection of 
the conjugate; or iv) targeted delivery to a selected tissue or cell, is 
provided herein. The conjugate can contain a plurality of agents linked 

25 thereto. The conjugate can be a chemical conjugate; and it can be a 
fusion protein. 

In yet another embodiment, the targeting agent is a protein or 
peptide fragment. The protein or peptide fragment can include a protein 
binding sequence, a nucleic acid binding sequence, a lipid binding 
30 sequence, a polysaccharide binding sequence, or a metal binding 
sequence. 
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Method of diagnosing a disease or disorder characterized by 
detecting an aberrant level of an MTSP, particularly an MTSP3, MTSP4 or 
MTSP 6, in a subject is provided. The method can be practiced by 
measuring the level of the DNA, RNA, protein or functional activity of the 
5 MTSP. An increase or decrease in the level of the DNA, RNA, protein or ^ 
functional activity of the MTSP, relative to the level of the DNA, RNA, 
protein or functional activity found in an analogous sample not having the 
disease or disorder (or other suitable control) is indicative of the presence 
of the disease or disorder in the subject or other relative any other 

10 suitable control. 

Combinations are provided herein. The combination can include: 
a) an inhibitor of the activity of an MTSP; and b) an anti-cancer treatment 
or agent. The MTSP inhibitor and the anti-cancer agent can be 
formulated in a single pharmaceutical composition or each is formulated in 

15 a separate pharmaceutical composition. The MTSP inhibitor can be an 
antibody or a fragment or binding portion thereof against the MTSP, such 
as an antibody that specifically binds to the protease domain, an inhibitor 
of the MTSP production, or an inhibitor of the MTSP membrane- 
localization or an inhibitor of MTSP activation. Other MTSP inhibitors 

20 include, but are not limited to, an antisense nucleic acid encoding the 
MTSP, particularly a portion of the protease domain; a nucleic acid 
encoding at least a portion of a gene encoding the MTSP with a 
heterologous nucleotide sequence inserted therein such that the 
heterologous sequence inactivates the biological activity encoded MTSP 

25 or the gene encoding it. The portion of the gene encoding the MTSP 
preferably flanks the heterologous sequence to promote homologous 
recombination with a genomic gene encoding the MTSP. 

Also, provided are methods for treating or preventing a tumor or 
cancer in a mammal by administering to a mammal an effective amount of 

30 an inhibitor of an MTSP3, MTSP4 or MTSP6, whereby the tumor or 

cancer is treated or prevented. The MTSP inhibitor used in the treatment 
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or for prophylaxis is administered with a pharmaceutically acceptable 
carrier or excipient. The mammal treated can be a human. The treatment 
or prevention method can additionally include administering an anti-cancer 
treatment or agent simultaneously with or subsequently or before 
5 administration of the MTSP inhibitor. 

Also provided is a recombinant non-human animal in which an 
endogenous gene of an MTSP has been deleted or inactivated by 
homologous recombination or insertional mutagenesis of the animal or an 
ancestor thereof. A recombinant non-human animal is provided herein, 

10 where the gene of an MTSP is under control of a promoter that is not the 
native promoter of the gene or that is not the native promoter of the gene 
in the non-human animal or where the nucleic acid encoding the MTSP is 
heterologous to the non-human animal and the promoter is the native or a 
non-native promoter. 

15 Also provided are methods of treatments of tumors by adminis- 

tering a prodrug that is activated by an MTSP that is expressed or active 
in tumor cells, particularly those in which its functional activity in tumor 
cells is greater than in none-tumor cells. The prodrug is administered and, 
upon administration, active MTSP expressed on cells cleaves the prodrug 

20 and releases active drug in the vicinity of these cells. The active anti- 
cancer drug accumulates in the vicinity of the tumor. This is particularly 
useful in instances in whtih an MTSP is expressed or active in greater 
quantity, higher level or predominantly in tumor cells compared to other 
cells. 

25 BRIEF DESCRIPTION OF DRAWINGS 

Figure 1 illustrates the domain organization of the MTSP3; 

Figure 2 illustrates the domain organization of the MTSP4 splice 
variants and domains thereof; MTSP4-L includes a transmembrane 
domain, a CUB domain, a tow density lipoprotein receptor (LDLR) 
30 domains, and a serine protease catalytic domain; MTSP4-S lacking the 
portion between amino acids 136-279. 
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Figure 3 depicts the domain organization of MTSP6. 

Figure 4 provides an alignment of the C-terminal portions of 
MTSP3, the two splice variant-encoded forms of MTSP4, and MTSP6, 
that encompasses the protease domains thereof; the figure shows the 
5 cleavage sites, which form the N-terminus of the protease domain of each 
protein; a potential glycosylation site is noted and the free Cys residues in 
the protease domain of each are noted (*). Muteins of each protein may 
be prepared by replacing the residues in the glycosylation site, particularly 
the N residue, and the free Cys residues, with preferably conservative 
10 amino acid residues. Such muteins are also provided herein. 
DETAILED DESCRIPTION OF THE INVENTION 
A. DEFINITIONS 

Unless defined otherwise, all technical and scientific terms used 
herein have the same meaning as is commonly understood by one of 
15 ordinary skill in the art to which this invention belongs. All patents, 

applications, published applications and other publications and sequences 
from GenBank and other data bases referred to herein are incorporated by 
reference in their entirety. 

As used herein, the abbreviations for any protective groups, amino 
20 acids and other compounds, are, unless indicated otherwise, in accord 
with their common usage, recognized abbreviations, or the lUPAC-lUB 
Commission on Biochemical Nomenclature {see, (1972) Biochem. 
11:942-944). 

As used herein, serine protease refers to a diverse family of 
25 proteases wherein a serine residue is involved in the hydrolysis of proteins 
or peptides. The serine residue can be part of the catalytic triad 
mechanism, which includes a serine, a histidine and an aspartic acid in 
the catalysis, or be part of the hydroxyl/e-amine or hydroxyl/a-amine 
catalytic dyad mechanism, which involves a serine and a lysine in the 
30 catalysis. 
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As used herein, "transmembrane serine protease (MTSP)" refers to 
a family of transmembrane serine proteases that share common structural 
features as described herein (see, also Hooper et aL (2001) J. Biol. 
Chem,276:8S7-860), Thus, reference, for example, to "MTSP" 
5 encompasses all proteins encoded by the MTSP gene family, including but 
are not limited to: MTSP1, MTSP3, MTSP4 and MTSP6, or an equivalent 
molecule obtained from any other source or that has been prepared 
synthetically or that exhibits the same activity. Other MTSPs include, but 
are not limited to, corin, enterpeptidase, human airway trypsin-like 

10 protease (HAT), MTSP1, TMPRSS2, and TMPRSS4. Sequences of 

encoding nucleic molecules and the encoded amino acid sequences of 
exemplary MTSPs and/or domains thereof are set forth in SEQ ID Nos, 1- 
12, 49, 50 and 61-72. The term also encompass MTSPs with 
conservative amino acid substitutions that do not substantially alter 

15 activity of each member, and also encompasses splice variants thereof. 
Suitable conservative substitutions of amino acids are known to those of 
skill in this art and may be made generally without altering the biological 
activity of the resulting molecule. Of particular interest are MTSPs of 
mammalian, including human, origin. Those of skill in this art recognize 

20 that, in general, single amino acid substitutions in non-essential regions of 
a polypeptide do not substantially alter biological activity (see, e.g., 
Watson et aL Molecular Biology of the Gene, 4th Edition, 1 987, The 
Bejacmin/Cummings Pub. co., p. 224). 

As used herein, a "protease domain of an MTSP" refers to the 

25 protease domain of MTSP that is located within the extracellular domain 
of a MTSP and exhibits serine proteolytic activity. It includes at least the 
smallest fragment thereof that acts catalytically as a single chain form. 
Hence it is at least the minimal portion of the extracellular domain that 
exhibits proteolytic activity as assessed by standard assays in vitro 

30 assays. Those of skill in this art recognize that such protease domain is 
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the portion of the protease that is structurally equivalent to the trypsin or 
chymotrypsin fold. 

Exemplary MTSP proteins, with the protease domains indicated, are 
illustrated in Figures 1-3, Smaller portions thereof that retain protease 
5 activity are contemplated. The protease domains vary in size and 
constitution, including insertions and deletions in surface loops. They 
retain conserved structure, including at least one of the active site triad, 
primary specificity pocket, oxyanion hole and/or other features of serine 
protease domains of proteases. Thus, for purposes herein, the protease 

10 domain is a portion of a MTSP, as defined herein, and is homologous to a 
domain of other MTSPs, such as corin, enterpeptidase, human airway 
trypsin-like protease (HAT), MTSP1, TMPRSS2, and TMPRSS4, which 
have been previously identified; it was not recognized, however, that an 
isolated single chain form of the protease domain could function 

15 proteolytically in //? vitro assays. As with the larger class of enzymnes of 
the chymotrypsin (SI) fold (see, e.g., Internet accessible MEROPS data 
base), the MTSPs protease domains share a high degree of amino acid 
sequence identity. The His, Asp and Ser residues necessary for activity 
are present in conserved motifs. The activation site, which results in the 

20 N-terminus of second chain in the two chain forms is has a conserved 
motif and readily can be identified (see, e.g., amino acids 801-806, SEQ 
ID No. 62, amino acids 406-410, SEQ ID No. 64; amino acids 186-190, 
SEQ ID No. 66; amino acids 161-166, SEQ ID No. 68; amino acids 255- 
259, SEQ ID No. 70; amino acids 190-194, SEQ ID No. 72). 

25 As used herein, the catalytically active domain of an MTSP refers 

to the protease domain. Reference to the protease domain of an MTSP 
refers includes the single and double-chain forms of any of these proteins. 
The zymogen form of each protein is single chain form, which can be 
converted to the active two chain form by cleavage. The protease 

30 domain may also be converted to a two chain form. By active form is 
meant a form active in vivo. 
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Significantly, it is shown herein, that, at least in vitro, the single 
chain forms of the MTSPs and the catalytic donnains or proteolytically 
active portions thereof (typically C-terminal trucnations) thereof exhibit 
protease activity. Hence provided herein are isolated single chain forms of 
5 the protease domains of MTSPs and their use in in vitro drug screening 
assays for identification of agents that modulate the activity thereof. 

As used herein an MTSPS, whenever referenced herein, includes 
at least one or all of or any combination of: 

a polypeptide encoded by the sequence of nucleotides set 
10 forth in SEQ ID No. 3; 

a polypeptide encoded by a sequence of nucleotides that 
hybridizes under conditions of low, moderate or high stringency to the 
sequence of nucleotides set forth in SEQ ID No. 3; 

a polypeptide that comprises the sequence of amino acids 
15 set forth as amino acids 205-437 of SEQ ID No. 4; 

a polypetide that comprises a sequence of amino acids 
having at least about 85% or 90% sequence identity with the sequence 
of amino acids set forth in SEQ ID No. 4; and/or 

a splice variant of the MTSP3 set forth in SEQ ID Nos. 3 

20 and 4. 

The MTSPS may be from any animal, particularly a mammal, and 
includes but are not limited to, humans, rodents, fowl, ruminants and 
other animals. The full length zymogen or double chain activated form is 
contemplated or any domain thereof, including the protease domain, 
25 which can be a double chain activated form, or a single chain form. 

As used herein an MTSP4, whenever referenced herein, includes 
at least one or all of or any combination of: 

a polypeptide encoded by the sequence of nucleotides set 
forth in any of SEQ ID No. 5, 7 or 9; 
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a polypeptide encoded by a sequence of nucleotides that 
hybridizes under conditions of low, moderate or high stringency to the 
sequence of nucleotides set forth in any of SEQ ID Nos. 5, 7 or 9; 

a polypeptide that comprises the sequence of amino acids 
set forth in any of SEQ ID Nos. 6, 8 or 10; 

a polypetide that comprises a sequence of amino acids 
having at least about 85% or 90% or 95% sequence identity with the 
sequence of amino acids set forth in SEQ ID No. 6, 8 or 10; and/or 

a splice variant of the MTSP4s set forth in SEQ ID Nos. 

7-10. 

The MTSP4 may be from any animal, particularly a mammal, and 
includes but are not limited to, humans, rodents, fowl, ruminants and 
other animals. The full length zymogen or double chain activated form is 
contemplated or any domain thereof, including the protease domain, 
which can be a double chain activated form, or a single chain form. 

As used herein an MTSP6, whenever referenced herein, includes at 
least one or all of or any combination of: 

a polypeptide encoded by the sequence of nucleotides set 
forth in any of SEQ ID No. 11; 

a polypeptide encoded by a sequence of nucleotides that 
hybridizes under conditions of low, moderate or high stringency to the 
sequence of nucleotides set forth in any of SEQ ID Nos. 1 1 ; 

a polypeptide that comprises the sequence of amino acids 
set forth in any of SEQ ID Nos. 12; 

a polypetide that comprises a sequence of amino acids 
having at least about 90% or 95% or 98% sequence identity with the 
sequence of amino acids set forth in SEQ ID No. 12; and/or 

a splice variant of the MTSP4s set forth in SEQ ID No, 12. 
The MTSP6 may be from any animal, particularly a mammal, and includes 
but are not limited to, humans, rodents, fowl, ruminants and other 
animals. The full length zymogen or double chain activated form is 
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contemplated or any donnain thereof, including the protease domain, 
which can be a double chain activated form, or a single chain form. Of 
particular interest herein is the MTSP6 of SEQ ID No. 12. 

As used herein, a human protein is one encoded by DNA present in 
the genome of a human, including all allelic variants and conservative 
variations as long as they are not variants found in other mammals. 

As used herein, a "nucleic acid encoding a protease domain or 
catalytically active portion of a MTSP" shall be construed as referring to a 
nucleic acid encoding only the recited single chain protease domain or 
active portion thereof, and not the other contiguous portions of the MTSP 
as a continuous sequence. 

As used herein, a CUB domain is a motif that mediates protein- 
protein interactions in complement components Clr/Cls and has also 
been identified in various proteins involved in developmental processes. 

As used herein, catalytic activity refers to the activity of the MTSP 
as a serine proteases. Function of the MTSP refers to its function in 
tumor biology, including promotion of or involvement in tumorigenesis, 
metastasis or carcinogenesis, and also roles in signal transduction. 

As used herein, a "propeptide" or "pro sequence" is sequence of 
amino acids positioned at the amino terminus of a mature biologically 
active polypeptide. When so-positioned, the resulting polypeptide is 
called a zymogen. Zymogens, generally, are biologically inactive and can 
be converted to mature active polypeptides by catalytic or autocatalytic 
cleavage of the propeptide from the zymogen. A zymogens is an enzyma- 
tically inactive protein that is converted to a proteolytic enzyme by the 
action of an activator. Cleavage may be effected autocatalytically. 

As used herein, "disease or disorder" refers to a pathological 
condition in an organism resulting from, e.g., infection or genetic defect, 
and characterized by identifiable symptoms. 

As used herein, neoplasm (neoplasia) refers to abnormal new 
growth, and thus means the same as tumor, which may be benign or 
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malignant. Unlike hyperplasia, neoplastic proliferation persists even in the 
absence of the original stimulus. 

As used herein, neoplastic disease refers to any disorder involving 
cancer, including tumor development, growth, metastasis and 
5 progression. 

As used herein, cancer refers to a general term for diseases caused 
by any type of malignant tumor. 

As used herein, malignant, as applies to tumors, refers to primary 
tumors that have the capacity of metastasis with loss of growth control 

10 and positional control. 

As used herein, an anti-cancer agent (used interchangeable with 
"anti-tumor or anti-neoplastic agent") refers to any agents used in the 
anti-cancer treatment- These include any agents, when used alone or in 
combination with other compounds, that can alleviate, reduce, ameliorate, 

15 prevent, or place or maintain in a state of remission of clinical symptoms 
or diagnostic markers associated with neoplastic disease, tumor and 
cancer, and can be used in methods, combinations and compositions 
provided herein. Non-limiting examples of anti-neoplastic agents include 
anti-angiogenic agents, alkylating agents, antimetabolite, certain natural 

20 products, platinum coordination complexes, anthracenediones, substituted 
ureas, methylhydrazine derivatives, adrenocortical suppressants, certain 
hormones, antagonists and anti-cancer polysaccharides. 

As used herein, a splice variant refers to a variant produced by 
differential processing of a primary transcript of genomic DNA that results 

25 in more than one type of mRNA. Splice variants of MTSPs are provided 
herein. 

As used herein, angiogenesis is intended to broadly encompass the 
totality of processes directly or indirectly involved in the establishment 
and maintenance of new vasculature (neovascularization), including, but 
30 not limited to, neovascularization associated with tumors. 
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As used herein, anti-angiogenic treatment or agent refers to any 
therapeutic regimen and compound, when used alone or in combination 
with other treatment or compounds, that can alleviate, reduce, 
ameliorate, prevent, or place or maintain in a state of remission of clinical 
symptoms or diagnostic markers associated with undesired and/or 
uncontrolled angiogenesis. Thus, for purposes herein an anti-angiogenic 
agent refers to an agent that inhibits the establishment or maintenance of 
vasculature. Such agents include, but are not limited to, anti-tumor 
agents, and agents for treatments of other disorders associated with 
undesirable angiogenesis, such as diabetic retinopathies, restenosis, 
hyperproliferative disorders and others. 

As used herein, non-anti-angiogenic anti-tumor agents refer to anti- 
tumor agents that do not act primarily by inhibiting angiogenesis. 

As used herein, pro-angiogenic agents are agents that promote the 
establishment or maintenance of the vasculature. Such agents include 
agents for treating cardiovascular disorders, including heart attacks and 
strokes. 

As used herein, undesired and/or uncontrolled angiogenesis refers 
to pathological angiogenesis wherein the influence of angiogenesis 
stimulators outweighs the influence of angiogenesis inhibitors. As used 
herein, deficient angiogenesis refers to pathological angiogenesis 
associated with disorders where there is a defect in normal angiogenesis 
resulting in aberrant angiogenesis or an absence or substantial reduction 
in angiogenesis. 

As used herein, endotheliase refers to a mammalian protein, 
including humans, that has a transmembrane domain and is expressed on 
the surface of endothelial cells and includes a protease domain, 
particularly an extracellular protease domain, and is preferably a serine 
protease. Thus, reference, for example, to endotheliase encompasses all 
proteins encoded by the endotheliase gene family, or an equivalent 
molecule obtained from any other source or that has been prepared 
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synthetically or that exhibits the same activity. The endotheliase gene 
family are transmembrane proteases expressed in endothelial cells. 
Endotheliases are excluded from the MTSPs contemplated herein. 

As used herein, the protease domain of an endotheliase refers to 
5 the polypeptide portion of the endotheliase that is the extracellular portion 
that exhibits protease activity. The protease domain is a polypeptide that 
includes at least the minimum number of amino acids, generally more 
than 50 or 100, required for protease activity. Protease activity may be 
assessed empirically, such as by testing the polypeptide for its ability to 

10 act as a protease. Assays, such as in the assays described in the 
EXAMPLES, except employing a known substrate in place of the test 
compounds as described in the Examples may be used. Furthermore, 
since proteases, particularly serine proteases, have characteristic 
structures and sequences or motifs, the protease domain may be readily 

15 identified by such structure and sequence or motif. 

As used herein, the protease domain of an MTSP protein refers to 
the protease domain of an MTSP that is located within or is the 
extracellular domain of an MTSP and exhibits serine proteolytic activity. 
Hence it is at least the minimal portion of the extracellular domain that 

20 exhibits proteolytic activity as assessed by standard assays in vitro. It 
refers, herein, to a single chain form heretofore thought to be inactive. 
Exemplary protease domains include at least a sufficient portion of 
sequences of amino acids set forth as amino acids 615-855 in SEQ ID 
No. 2 (encoded by nucleotides 1865-2587 in SEQ ID No. 1; see also SEQ 

25 ID Nos. 49 and 50) from MTSP1 , amino acids 205-437 of SEQ ID NO. 4 
from MTSPS, SEQ ID No. 6, which sets forth the protease domain of 
MTSP4, and amino acids 217-443 of SEQ ID No. 1 1 from MTSP6. Also 
contemplated are nucleic acid molecules that encode polypeptide that has 
proteolytic activity in an in vitro proteolysis assay and that have at least 

30 80%, 85%, 90% or 95% sequence identity with the full length of a 
protease domain of an MTSP protein, or that hybridize along their full 
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length to a nucleic acids that encode a protease domain, particularly 
under conditions of moderate, generally high, stringency. 

For each of these protease domains, residues at the N-terminus can 
be critical for activity, since it has been shown that an Asp in the N- 
5 terminus of such proteases is essential for formation of the catalytically 
active conformation upon activation cleavage of the zymogen form of the 
protease. It is shown herein that the protease domain of the singles chain 
form of the protease is catalytically active. Hence the protease domain 
will require the N-terminal amino acids; the c-terminus portion may be 

10 truncated. The amount that can be removed can be determined 

empirically by testing the protein for protease activity in an in vitro assays 
that assesses catalytic cleavage. 

Hence smaller portions of the protease domains, particularly the 
single chain domains, thereof that retain protease activity are 

15 contemplated. Such smaller versions will generally be C-terminal 

truncated versions of the protease domains. The protease domains vary 
in size and constitution, including insertions and deletions in surface 
loops. Such domains exhibit conserved structure, including at least one 
structural feature, such as the active site triad, primary specificity pocket, 

20 oxyanion hole and/or other features of serine protease domains of 

proteases. Thus, for purposes herein, the protease domain is a single 
chain portion of an MTSP, as defined herein, but is homologous in its 
structural features and retention of sequence of similarity or homology the 
protease domain of chymotrypsin or trypsin. Most significantly, the 

25 polypeptide will exhibit proteolytic activity as a single chain. 

As used herein, by homologous means about greater than 25% 
nucleic acid sequence identity, preferably 25% 40%, 60%, 80%, 90% or 
95%. The terms "homology" and "identity" are often used interchange- 
ably. In general, sequences are aligned so that the highest order match is 

30 obtained (see, e.g.\ Computational l\/lolecular Biology , Lesk, A.M., ed., 

Oxford University Press, New York, 1988; Biocomputing: Informatics and 
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Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993; 
Computer Analysis of Sequence Data, Part I, Griffin, A.M., and Griffin, 
H.G., eds., Humana Press, New Jersey, 1994; Sequence Analysis in 
Molecular Biology, von Heinje, G., Academic Press, 1987; and Sequence 
5 Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, 
New York, 1991; Carillo et aL (1988) SIAMJ Applied Math 48-A013). 

By sequence identity, the number of conserved amino acids are 
determined by standard alignment algorithms programs, and are used with 
default gap penalties established by each supplier. Substantially 
10 homologous nucleic acid molecules would hybridize typically at moderate 
stringency or at high stringency all along the length of the nucleic acid of 
interest. Also contemplated are nucleic acid molecules that contain 
degenerate codons in place of codons in the hybridizing nucleic acid 
molecule. 

15 Whether any two nucleic acid molecules have nucleotide sequences 

that are at least 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% 
"identical" can be determined using known computer algorithms such as 
the "FAST A" program, using for example, the default parameters as in 
Pearson et al. (1988) Proc, Natl. Acad. ScL USA 55:2444 (other 

20 programs include the GCG program package (Devereux, J., et aL, Nucleic 
Acids Research 72(l):387 (1984)), BLASTP, BLASTN, FASTA (Atschul, 
S.F., etal., J Molec Biol 215:403 (1990); Guide to Huge Computers, 
Martin J. Bishop, ed.. Academic Press, San Diego, 1994, and Carillo et al. 
(1988) SIAM J Applied Math 4S: 1073). For example, the BLAST function 

25 of the National Center for Biotechnology Information database may be 
used to determine identity. Other commercially or publicly available 
programs include, DNAStar "MegAlign" program (Madison, Wl) and the 
University of Wisconsin Genetics Computer Group (UWG) "Gap" program 
(Madison Wl)}. Percent homology or identity of proteins and/or nucleic 

30 acid moleucles may be determined, for example, by comparing sequence 
information using a GAP computer program (e.g., Needleman et al. 
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(1970) J. Mol. Biol. 48:443, as revised by Smith and Waterman ((1981) 
Adv. Appl. Math. 2:482). Briefly, the GAP program defines similarity as 
the number of aligned symbols (i.e., nucleotides or amino acids) which 
are similar, divided by the total number of symbols in the shorter of the 
5 two sequences. Default parameters for the GAP program may include: 
(1) a unary comparison matrix (containing a value of 1 for identities and 0 
for non-identities) and the weighted comparison matrix of Gribskov et al. 
(1986) Nucl. Acids Res. 14:6745, as described by Schwartz and Dayhoff, 
eds., ATLAS OF PROTEIN SEQUENCE AND STRUCTURE, National 

10 Biomedical Research Foundation, pp. 353-358 (1979); (2) a penalty of 
3.0 for each gap and an additional 0.10 penalty for each symbol in each 
gap; and (3) no penalty for end gaps. 

Therefore, as used herein, the term "identity" represents a 
comparison between a test and a reference polypeptide or polynucleotide. 

15 For example, a test polypeptide may be defined as any polypeptide that 
is 90% or more identical to a reference polypeptide. As used herein, the 
term at least "90% identical to" refers to percent identities from 90 to 
99.99 relative to the reference polypeptides. Identity at a level of 90% or 
more is indicative of the fact that, assuming for exemplification purposes 

20 a test and reference polynucleotide length of 100 amino acids are 

compared. No more than 10% (i.e., 10 out of 100) amino acids in the 
test polypeptide differs from that of the reference polypeptides. Similar 
comparisons may be made between a test and reference polynucleotides. 
Such differences may be represented as point mutations randomly 

25 distributed over the entire length of an amino acid sequence or they may 
be clustered in one or more locations of varying length up to the 
maximum allowable, e.g. 10/100 amino acid difference (approximately 
90% identity). Differences are defined as nucleic acid or amino acid 
substitutions, or deletions. At level of homologies or identities above 

30 about 85-90%, the result should be independent of the program and gap 
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parameters set; such high levels of identity readily can be assess, often 
without relying on software. 

As used herein, primer refers to an oligonucleotide containing two 
or more deoxyribonucleotides or ribonucleotides, preferably more than 
three, from which synthesis of a primer extension product can be 
initiated. Experimental conditions conducive to synthesis include the 
presence of nucleoside triphosphates and an agent for polymerization and 
extension, such as DNA polymerase, and a suitable buffer, temperature 
and pH. 

As used herein, animals include any animal, such as, but are not 
limited to, goats, cows, deer, sheep, rodents, pigs and humans. Non- 
human animals, exclude humans as the contemplated animal. The MTSPs 
provided herein are from any source, animal, plant, prokaryotic and 
fungal. Preferred MTSPs are of animal origin, preferably mammalian 
origin. 

As used herein, genetic therapy involves the transfer of 
heterologous DNA to the certain cells, target cells, of a mammal, 
particularly a human, with a disorder or conditions for which such therapy 
is sought. The DNA is introduced into the selected target cells in a 
manner such that the heterologous DNA is expressed and a therapeutic 
product encoded thereby is produced. Alternatively, the heterologous 
DNA may in some manner mediate expression of DNA that encodes the 
therapeutic product, or it may encode a product, such as a peptide or 
RNA that in some manner mediates, directly or indirectly, expression of a 
therapeutic product. Genetic therapy may also be used to deliver nucleic 
acid encoding a gene product that replaces a defective gene or 
supplements a gene product produced by the mammal or the cell in which 
it is introduced. The introduced nucleic acid may encode a therapeutic 
compound, such as a growth factor inhibitor thereof, or a tumor necrosis 
factor or inhibitor thereof, such as a receptor therefor, that is not normally 
produced in the mammalian host or that is not produced in therapeutically 
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effective amounts or at a therapeutically useful time. The heterologous 
DNA encoding the therapeutic product may be modified prior to 
introduction into the cells of the afflicted host in order to enhance or 
otherwise alter the product or expression thereof. Genetic therapy may 
5 also involve delivery of an inhibitor or repressor or other modulator of 
gene expression. 

As used herein, heterologous DNA is DNA that encodes RNA and 
proteins that are not normally produced in vivo by the cell in which it is 
expressed or that mediates or encodes mediators that alter expression of 

10 endogenous DNA by affecting transcription, translation, or other 
regulatable biochemical processes. Heterologous DNA may also be 
referred to as foreign DNA. Any DNA that one of skill in the art would 
recognize or consider as heterologous or foreign to the cell in which is 
expressed is herein encompassed by heterologous DNA. Examples of 

15 heterologous DNA include, but are not limited to, DNA that encodes 

traceable marker proteins, such as a protein that confers drug resistance, 
DNA that encodes therapeutically effective substances, such as anti- 
cancer agents, enzymes and hormones, and DNA that encodes other 
types of proteins, such as antibodies. Antibodies that are encoded by 

20 heterologous DNA may be secreted or expressed on the surface of the 
cell in which the heterologous DNA has been introduced. 

Hence, herein heterologous DNA or foreign DNA, includes a DNA 
molecule not present in the exact orientation and position as the 
counterpart DNA molecule found in the genome. It may also refer to a 

25 DNA molecule from another organism or species (i.e., exogenous). 

As used herein, a therapeutically effective product is a product that 
is encoded by heterologous nucleic acid, typically DNA, that, upon 
introduction of the nucleic acid into a host, a product is expressed that 
ameliorates or eliminates the symptoms, manifestations of an inherited or 

30 acquired disease or that cures the disease. 
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As used herein, recitation that a polypeptide consists essentially of 
the protease domain means that the only MTSP portion of the polypeptide 
is a protease domain or a catalytically active portion thereof. The 
polypeptide may optionally, and generally will, include additional non- 
MTSP-derived sequences of amino acids. 

As used herein, cancer or tumor treatment or agent refers to any 
therapeutic regimen and/or compound that, when used alone or in 
combination with other treatments or compounds, can alleviate, reduce, 
ameliorate, prevent, or place or maintain in a state of remission of clinical 
symptoms or diagnostic markers associated with deficient angiogenesis. 

As used herein, domain refers to a portion of a molecule, e.g., 
proteins or nucleic acids, that is structurally and/or functionally distinct 
from other portions of the molecule. 

As used herein, protease refers to an enzyme catalyzing hydrolysis 
of proteins or peptides. For purposes herein, the protease domain is a 
single chain form of an MTSP protein. For MTSP3 and MTSP4 the 
protease domain also includes two chain forms. 

As used herein, catalytic activity refers to the activity of the MTSP 
as a protease as assessed in in vitro proteolytic assays that detect 
proteolysis of a selected substrate. 

As used herein, nucleic acids include DNA, RNA and aniages 
thereof, including protein nucleic acids (PNA) and mixture thereof. 
Nucleic acids can be single or double stranded. When referring to probes 
or primers, optionally labeled, with a detectable label, such as a 
fluorescent or radiolabel, single-stranded molecules are contemplated. 
Such molecules are typically of a length such that they are statistically 
unique of low copy number (typically less than 5, preferably less than 3) 
for probing or priming a library. Generally a probe or primer contains at 
least 14, 16 or 30 contiguous of sequence complementary to or identical 
a gene of interest. Probes and primers can be 10, 20, 30, 50, 100 or 
more nucleic acids long. 
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As used herein, nucleic acid encoding a fragment or portion of an 
MTSP refers to a nucleic acid encoding only the recited fragment or 
portion of MTSP, and not the other contiguous portions of the MTSP. 

As used herein, heterologous or foreign DNA and RNA are used 
interchangeably and refer to DNA or RNA that does not occur naturally as 
part of the genome in which it is present or which is found in a location 
or locations in the genome that differ from that in which it occurs in 
nature. Heterologous nucleic acid is generally not endogenous to the cell 
into which it is introduced, but has been obtained from another cell or 
prepared synthetically. Generally, although not necessarily, such nucleic 
acid encodes RNA and proteins that are not normally produced by the cell 
in which it is expressed. Any DNA or RNA that one of skill in the art 
would recognize or consider as heterologous or foreign to the cell in 
which it is expressed is herein encompassed by heterologous DNA. 
Heterologous DNA and RNA may also encode RNA or proteins that 
mediate or alter expression of endogenous DNA by affecting transcription, 
translation, or other regulatable biochemical processes. 

As used herein, operative linkage of heterologous DNA to 
regulatory and effector sequences of nucleotides, such as promoters, 
enhancers, transcriptional and translational stop sites, and other signal 
sequences refers to the relationship between such DNA and such 
sequences of nucleotides. For example, operative linkage of heterologous 
DNA to a promoter refers to the physical relationship between the DNA 
and the promoter such that the transcription of such DNA is initiated from 
the promoter by an RNA polymerase that specifically recognizes, binds to 
and transcribes the DNA in reading frame. 

As used herein, a sequence complementary to at least a portion of 
an RNA, with reference to antisense oligonucleotides, means a sequence 
having sufficient complementarily to be able to hybridize with the RNA, 
preferably under moderate or high stringency conditions, forming a stable 
duplex; in the case of double-stranded MTSP antisense nucleic acids, a 
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single strand of the duplex DNA nnay thus be tested, or triplex formation 
may be assayed. The ability to hybridize depends on the degree of 
complementarily and the length of the antisense nucleic acid. Generally, 
the longer the hybridizing nucleic acid, the more base mismatches with a 
5 MTSP encoding RNA it can contain and still form a stable duplex (or 
triplex, as the case may be). One skilled in the art can ascertain a 
tolerable degree of mismatch by use of standard procedures to determine 
the melting point of the hybridized complex. 

For purposes herein, conservative amino acid substitutions may be 

10 made in any of MTSPs and protease domains thereof provided that the 
resulting protein exhibits protease activity. Conservative amino acid 
substitutions, such as those set forth in Table 1, are those that do not 
eliminate proteolytic activity. Suitable conservative substitutions of 
amino acids are known to those of skill in this art and may be made 

15 generally without altering the biological activity of the resulting molecule. 
Those of skill in this art recognize that, in general, single amino acid 
substitutions in non-essential regions of a polypeptide do not substantially 
alter biological activity (see, e.g. , Watson et aL Molecular Biology of the 
Gene, 4th Edition, 1987, The Bejacmin/Cummings Pub. co., p. 224). 

20 Also included within the definition, is the catalytically active fragment of 
an MTSP, particularly a single chain protease portion. Conservative 
amino acid substitutions are made, for example, in accordance with those 
set forth in TABLE 1 as follows: 
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TABLE 1 



VJIIUIIICll it^olUUt? 


OQiloSrVaXlVe SUDSXllUTIOn 


Ala (A) 


Gly; Ser, Abu 


Arg (R) 


Lys, orn 


Asn (N) 


Gin; His 


Cys (C) 


Ser 


Gin (Q) 


Asn 


Glu (E) 


Asp 


Gly (G) 


Ala; Pro 


His (H) 


Asn; Gin 


lie (1) 


Leu; Val; Met; NIe; Nva 


Leu (L) 


lie; Val; Met; NIe; Nv 


Lys (K) 


Arg; Gin; Glu 


Met (M) 


Leu; Tyr; He; NLe Val 


Ornitine 


Lys; Arg 


Phe (F) 


Met; Leu; Tyr 


Ser (S) 


Thr 


Thr (T) 


Ser 


Trp (W) 


Tyr 


Tyr (Y) 


Trp; Phe 


Val {V) 


lie; Leu; Met; NIe; Nv 



Other substitutions are also pernnissible and may be determined empirically or 
in accord with known conservative substitutions. 

As used herein, Abu is 2-aminobutyric acid; Orn is ornithine. 

As used herein, the amino acids, which occur in the various amino 
acid sequences appearing herein, are identified according to their well- 
known, three-letter or one-letter abbreviations. The nucleotides, which occur 
in the various DNA fragments, are designated with the standard single-letter 
designations used routinely in the art. 

As used herein, a splice variant refers to a variant produced by 
differential processing of a primary transcript of genomic DNA that results in 
more than one type of mRNA. 

As used herein, a probe or primer based on a nucleotide sequence 
disclosed he rein, includes at least 1 0, 1 4, preferably at least 1 6 or 30 or 1 00 
contiguous sequence of nucleotides of SEQ ID Nos. 1 , 3, 5, 7, 9 or 1 1 . 

As used herein, amelioration of the symptoms of a particular disorder 
by administration of a particular pharmaceutical composition refers to any 
lessening, whether permanent or temporary, lasting or transient that can be 
attributed to or associated with administration of the composition. 
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As used herein, antisense polynucleotides refer to synthetic sequences 
of nucleotide bases connplementary to nnRNA or the sense strand of double 
stranded DNA. Admixture of sense and antisense polynucleotides under 
appropriate conditions leads to the binding of the two molecules, or 
hybridization. When these polynucleotides bind to (hybridize with) mRNA, 
inhibition of protein synthesis (translation) occurs. When these 
polynucleotides bind to double stranded DNA, inhibition of RNA synthesis 
(transcription) occurs. The resulting inhibition of translation and/or 
transcription leads to an inhibition of the synthesis of the protein encoded by 
the sense strand. Antisense nucleic acid molecule typically contain a 
sufficient number of nucleotides to specifically bind to a target nucleic acid, 
generally at least 5 contiguous nucleotides, often at least 14 or 16 or 30 
contiguous nucleotides or modified nucleotides complementary to the coding 
portion of a nucleic acid molecule that encodes a gene of interest, for 
example, nucleic acid encoding a single chain protease domain of an MTSP. 

As used herein, an array refers to a collection of elements, such as 
antibodies, containing three or more members. An addressable array is one 
in which the members of the array are identifiable, typically by position on a 
solid phase support. Hence, in general the members of the array will be 
immobilized to discrete identifiable loci on the surface of a solid phase. 

As used herein, antibody refers to an immunoglobulin, whether natural 
or partially or wholly synthetically produced, including any derivative thereof 
that retains the specific binding ability the antibody. Hence antibody 
includes any protein having a binding domain that is homologous or 
substantially homologous to an immunoglobulin binding domain. Antibodies 
include members of any immunoglobulin claims, including IgG, IgM, IgA, IgD 
and IgE. 

As used herein, antibody fragment refers to any derivative of an 
antibody that is less then full length, retaining at least a portion of the full- 
length antibody's specific binding ability. Examples of antibody fragments 
include, but are not limited to. Fab, Fab', F(ab)2, single-chain Fvs (scFV), FV, 
dsFV diabody and Fd fragments. The fragment can include multiple chains 
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linked together, such as by disulfide bridges. An antibody fragment generally 
contains at least about 50 amino acids and typically at least 200 amino 
acids. 

As used herein, an Fv antibody fragment is composed of one variable 
5 heavy domain (Vh) and one variable light domain linked by noncovalent 
interactions. 

As used herein, a dsFV refers to an Fv with an engineered 
intermolecular disulfide bond, which stabilizes the Vh-Vl pair. 

As used herein, an F(ab)2 fragment is an antibody fragment that 
10 results from digestion of an immunoglobulin with pepsin at pH 4.0-4.5; it 
may be recombinantly produced. 

As used herein. Fab fragments is an antibody fragment that results 
from digestion of an immunoglobulin with papain; it may be recombinantly 
produced. 

15 As used herein, scFVs refer to antibody fragments that contain a 

variable light chain (VJ and variable heavy chain (Vh) covalently connected 
by a polypeptide linker in any order. The linker is of a length such that the 
two variable domains are bridged without substantial interference. Preferred 
linkers are (Gly-Ser)^ residues with some Glu or Lys residues dispersed 

20 throughout to increase solubility. 

As used herein, humanized antibodies refer to antibodies that are 
modified to include human sequences of amino acids so that administration 
to a human will not provoke an immune response. Methods for preparation 
of such antibodies are known. For example, the hybridoma that expresses 

25 the monoclonal antibody is altered by recombinant DNA techniques to 

express an antibody in which the amino acid composition of the non-variable 
regions is based on human antibodies. Computer programs have been 
designed to identify such regions. 

As used herein, diabodies are dimeric scFV; diabodies typically have 

30 shorter peptide linkers than scFvs, and they preferentially dimerize. 

As used herein, humanized antibodies refer to antibodies that are 
modified to include human sequences of amino acids so that administration 
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to a human will not provoke an innnnune response. Methods for preparation 
of such antibodies are known. For exannple, the hybridoma that expresses 
the monoclonal antibody is altered by recombinant DNA techniques to 
express an antibody in which the amino acid composition of the non-variable 
5 regions is based on human antibodies. Computer programs have been 
designed to identify such regions. 

As used herein, production by recombinant means by using 
recombinant DNA methods means the use of the well known methods of 
molecular biology for expressing proteins encoded by cloned DNA. 

10 As used herein the term assessing is intended to include 

quantitative and qualitative determination in the sense of obtaining an 
absolute value for the activity of an MTSP, or a domain thereof, present in 
the sample, and also of obtaining an index, ratio, percentage, visual or other 
value indicative of the level of the activity. Assessment may be direct or 

15 indirect and the chemical species actually detected need not of course be the 
proteolysis product itself but may for example be a derivative thereof or 
some further substance. 

As used herein, biological activity refers to the in vivo activities of a 
compound or physiological responses that result upon m vivo administration 

20 of a compound, composition or other mixture. Biological activity, thus, 
encompasses therapeutic effects and pharmaceutical activity of such 
compounds, compositions and mixtures. Biological activities may be 
observed in in vitro systems designed to test or use such activities. Thus, 
for purposes herein the biological activity of a luciferase is its oxygenase 

25 activity whereby, upon oxidation of a substrate, light is produced. 

As used herein, a combination refers to any association between two 
or among more items. 

As used herein, a composition refers to a any mixture. It may be a 
solution, a suspension, liquid, powder, a paste, aqueous, non-aqueous or any 

30 combination thereof. 
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As used herein, a conjugate refers to the compounds provided herein 
that include one or more MTSPs, particularly single chain protease domains 
thereof, and one or more targeting agents. These conjugates include those 
produced by recombinant means as fusion proteins, those produced by 
5 chemical means, such as by chemical coupling, through, for example, 
coupling to sulfhydryl groups, and those produced by any other method 
whereby at least one MTSP, or a domain thereof, is linked, directly or 
indirectly via linker{s) to a targeting agent. 

As used herein, a targeting agent, is any moiety, such as a protein or 
10 effective portion thereof, that provides specific binding of the conjugate to a 
celt surface receptor, which, preferably, internalizes the conjugate or MTSP 
portion thereof. A targeting agent may also be one that promotes or 
facilitates, for example, affinity isolation or purification of the conjugate; 
attachment of the conjugate to a surface; or detection of the conjugate or 
15 complexes containing the conjugate. 

As used herein, an antibody conjugate refers to a conjugate in which 
the targeting agent is an antibody. 

As used herein, humanized antibodies refer to antibodies that are 
modified to include human sequences of amino acids so that administration 
20 to a human will not provoke an immune response. Methods for preparation 
of such antibodies are known. For example, the hybridoma that expresses 
the monoclonal antibody is altered by recombinant DNA techniques to 
express an antibody in which the amino acid composition of the non-variable 
regions is based on human antibodies. Computer programs have been 
25 designed to identify such regions. 

As used herein, derivative or analog of a molecule refers to a portion 
derived from or a modified version of the molecule. 

As used herein, fluid refers to any composition that can flow. Fluids 
thus encompass compositions that are in the form of semi-solids, pastes, 
30 solutions, aqueous mixtures, gels, lotions, creams and other such 
compositions. 
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As used herein, an effective annount of a compound for treating a 
particular disease is an annount that is sufficient to ameliorate, or in some 
manner reduce the symptoms associated with the disease. Such amount 
may be administered as a single dosage or may be administered according to 
5 a regimen, whereby it is effective. The amount may cure the disease but, 
typically, is administered in order to ameliorate the symptoms of the disease. 
Repeated administration may be required to achieve the desired amelioration 
of symptoms. 

As used herein equivalent, when referring to two sequences of nucleic 
10 acids means that the two sequences in question encode the same sequence 
of amino acids or equivalent proteins. When equivalent is used in referring 
to two proteins or peptides, it means that the two proteins or peptides have 
substantially the same amino acid sequence with only conservative amino 
acid substitutions (see, e,g,, Table 1, above) that do not substantially alter 
15 the activity or function of the protein or peptide. When equivalent refers to a 
property, the property does not need to be present to the same extent [e.g., 
two peptides can exhibit different rates of the same type of enzymatic 
activity], but the activities are preferably substantially the same. 
Complementary, when referring to two nucleotide sequences, means that 
20 the two sequences of nucleotides are capable of hybridizing, preferably with 
less than 25%, more preferably with less than 15%, even more preferably 
with less than 5%, most preferably with no mismatches between opposed 
nucleotides. Preferably the two molecules will hybridize under conditions of 
high stringency. 

25 As used herein, an agent that modulates the activity of a protein or 

expression of a gene or nucleic acid either decreases or increases or 
otherwise alters the activity of the protein or, in some manner up- or down- 
regulates or otherwise alters expression of the nucleic acid in a cell. 

As used herein, inhibitor of an the activity of an MTSP encompasses 

30 any substances that prohibit or decrease production, post-translational 

modification{s), maturation, or membrane localization of the MTSP or any 
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substances that interfere with or decrease the proteolytic efficacy of thereof, 
particular of a single chain fornn in vitro. 

As used herein, a method for treating or preventing neoplastic disease 
means that any of the symptoms, such as the tumor, metastasis thereof, the 
5 vascularization of the tumors or other parameters by which the disease is 
characterized are reduced, ameliorated, prevented, placed in a state of 
remission, or maintained in a state of remission. It also means that the 
hallmarks of neoplastic disease and metastasis may be eliminated, reduced or 
prevented by the treatment. Non-limiting examples of the hallmarks include 
10 uncontrolled degradation of the basement membrane and proximal 

extracellular matrix, migration, division, and organization of the endothelial 
cells into new functioning capillaries, and the persistence of such functioning 
capillaries. 

As used herein, operatively linked or operationally associated refers to 

15 the functional relationship of DNA with regulatory and effector sequences of 
nucleotides, such as promoters, enhancers, transcriptional and translational 
stop sites, and other signal sequences. For example, operative linkage of 
DNA to a promoter refers to the physical and functional relationship between 
the DNA and the promoter such that the transcription of such DNA is 

20 initiated from the promoter by an RNA polymerase that specifically 
recognizes, binds to and transcribes the DNA. In order to optimize 
expression and/or in vitro transcription, it may be necessary to remove, add 
or alter 5' untranslated portions of the clones to eliminate extra, potential 
inappropriate alternative translation initiation {i.e., start) codons or other 

25 sequences that may interfere with or reduce expression, either at the level of 
transcription or translation. Alternatively, consensus ribosome binding sites 
(see, e^, Kozak J. Biol. Chem. 266:19867-19870 (1991)) can be inserted 
immediately 5' of the start codon and may enhance expression. The 
desirability of (or need for) such modification may be empirically determined. 

30 As used herein, pharmaceutically acceptable salts, esters or other 

derivatives of the conjugates include any salts, esters or derivatives that may 
be readily prepared by those of skill in this art using known methods for such 
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derivatization and that produce compounds that nnay be administered to 
animals or humans without substantial toxic effects and that either are 
pharmaceutically active or are prodrugs. 

As used herein, a prodrug is a compound that, upon in vivo 
5 administration, is metabolized or otherwise converted to the biologically, 
pharmaceutically or therapeutically active form of the compound. To 
produce a prodrug, the pharmaceutically active compound is modified such 
that the active compound will be regenerated by metabolic processes. The 
prodrug may be designed to alter the metabolic stability or the transport 

10 characteristics of a drug, to mask side effects or toxicity, to improve the 
flavor of a drug or to alter other characteristics or properties of a drug. By 
virtue of knowledge of pharmacodynamic processes and drug metabolism in 
vivo , those of skill in this art, once a pharmaceutically active compound is 
known, can design prodrugs of the compound (see, e^, Nogrady (1985) 

15 Medicinal Chemistry A Biochemical Approach , Oxford University Press, New 

York, pages 388-392). 

As used herein, a drug identified by the screening methods provided 
herein refers to any compound that is a candidate for use as a therapeutic or 
as lead compound for designed a therapeutic. Such compounds can be small 
20 molecules, including small organic molecules, peptides, peptide mimetics, 
antisense molecules, antibodies, fragments of antibodies, recombinant 
antibodies and other such compound which can serve as drug candidate or 

lead compound. 

As used herein, production by recombinant means by using 

25 recombinant DNA methods means the use of the well known methods of 
molecular biology for expressing proteins encoded by cloned DNA. 

As used herein, a promoter region or promoter element refers to a 
segment of DNA or RNA that controls transcription of the DNA or RNA to 
which it is operatively linked. The promoter region includes specific 

30 sequences that are sufficient for RNA polymerase recognition, binding and 
transcription initiation. This portion of the promoter region is referred to as 
the promoter. In addition, the promoter region includes sequences that 
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modulate this recognition, binding and transcription initiation activity of RNA 
polymerase. These sequences may be c/'s acting or may be responsive to 
trans acting factors. Promoters, depending upon the nature of the 
regulation, may be constitutive or regulated. Exemplary promoters 
5 contemplated for use in prokaryotes include the bacteriophage T7 and T3 
promoters. 

As used herein, a receptor refers to a molecule that has an affinity for 
a given ligand. Receptors may be naturally-occurring or synthetic molecules. 
Receptors may also be referred to in the art as anti-ligands. As used herein, 

10 the receptor and anti-ligand are interchangeable. Receptors can be used in 
their unaltered state or as aggregates with other species. Receptors may be 
attached, covalently or noncovalently, or in physical contact with, to a 
binding member, either directly or indirectly via a specific binding substance 
or linker. Examples of receptors, include, but are not limited to: antibodies, 

15 cell membrane receptors surface receptors and internalizing receptors, 
monoclonal antibodies and antisera reactive with specific antigenic 
determinants [such as on viruses, cells, or other materials], drugs, 
polynucleotides, nucleic acids, peptides, cofactors, lectins, sugars, 
polysaccharides, cells, cellular membranes, and organelles. 

20 Examples of receptors and applications using such receptors, include 

but are not restricted to: 

a) enzymes: specific transport proteins or enzymes essential to 
survival of microorganisms, which could serve as targets for antibiotic 
[ligand] selection; 

25 b) antibodies: identification of a ligand-binding site on the antibody 

molecule that combines with the epitope of an antigen of interest may be 
investigated; determination of a sequence that mimics an antigenic epitope 
may lead to the development of vaccines of which the immunogen is based 
on one or more of such sequences or lead to the development of related 

30 diagnostic agents or compounds useful in therapeutic treatments such as for 
auto-immune diseases 
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c) nucleic acids: identification of ligand, such as protein or RNA, 
binding sites; 

d) catalytic polypeptides: polymers, preferably polypeptides, that are 
capable of promoting a chemical reaction involving the conversion of one or 

5 more reactants to one or more products; such polypeptides generally include 
a binding site specific for at least one reactant or reaction intermediate and 
an active functionality proximate to the binding site, in which the 
functionality is capable of chemically modifying the bound reactant [see, 
e.g. , U.S. Patent No. 5,215,899]; 
10 e) hormone receptors: determination of the ligands that bind with 

high affinity to a receptor is useful in the development of hormone 
replacement therapies; for example, identification of ligands that bind to such 
receptors may lead to the development of drugs to control blood pressure; 
and 

15 f) opiate receptors: determination of ligands that bind to the opiate 

receptors in the brain is useful in the development of less-addictive 
replacements for morphine and related drugs. 

As used herein, sample refers to anything which may contain an 
analyte for which an analyte assay is desired. The sample may be a 

20 biological sample, such as a biological fluid or a biological tissue. Examples 
of biological fluids include urine, blood, plasma, serum, saliva, semen, stool, 
sputum, cerebral spinal fluid, tears, mucus, amniotic fluid or the like. 
Biological tissues are aggregate of cells, usually of a particular kind together 
with their intercellular substance that form one of the structural materials of 

25 a human, animal, plant, bacterial, fungal or viral structure, including 

connective, epithelium, muscle and nerve tissues. Examples of biological 
tissues also include organs, tumors, lymph nodes, arteries and individual 
celKs). 

As used herein: stringency of hybridization in determining percentage 
30 mismatch is as follows: 

1) high stringency: 0.1 x SSPE, 0.1% SDS, 65^C 

2) medium stringency: 0.2 x SSPE, 0.1% SDS, 50°C 
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3) low Stringency: 1 .0 x SSPE, 0.1 % SDS, 50^C 
Those of skill in this art know that the washing step selects for stable 
hybrids and also know the ingredients of SSPE (see, e.g. , Sambrook, E.F. 
Fritsch, T. Maniatis, in: Molecular Cloning, A Laboratory Manual, Cold 
5 Spring Harbor Laboratory Press (1989), vol. 3, p. B.I 3, see, also, numerous 
catalogs that describe commonly used laboratory solutions). SSPE is pH 7.4 
phophate- buffered 0.18 NaCI. Further, those of skill in the art recognize 
that the stability of hybrids is determined by T^, which is a function of the 
sodium ion concentration and temperature (T^ = 81.5° C-1 6.6(logio[Na^]) + 
10 0.41 {%G + 0-600/1)), so that the only parameters in the wash conditions 

critical to hybrid stability are sodium ion concentration in the SSPE (or SSC) 

and temperature. 

It is understood that equivalent stringencies may be achieved using 
alternative buffers, salts and temperatures. By way of example and not 

15 limitation, procedures using conditions of low stringency are as follows (see 
also Shilo and Weinberg, Proc, Natl. Acad, ScL USA. 78:6789-6792 (1981)): 
Filters containing DNA are pretreated for 6 hours at 40°C in a solution 
containing 35% formamide, 5X SSC, 50 mM Tris-HCI (pH 7.5), 5 mM EDTA, 
0.1% PVP, 0.1% Ficoll, 1% BSA, and 500 //g/ml denatured salmon sperm 

20 DNA (10X SSC is 1.5 M sodium chloride, and 0.15 M sodium citrate, 

adjusted to a pH of 7). 

Hybridizations are carried out in the same solution with the following 
modifications: 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100/ig/ml salmon 
sperm DNA, 10% (wt/vol) dextran sulfate, and 5-20 X 10® cpm ^^P-labeled 

25 probe is used. Filters are incubated in hybridization mixture for 18-20 hours 
at 40°C, and then washed for 1 .5 hours at 55*^C in a solution containing 2X 
SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1% SDS. The wash 
solution is replaced with fresh solution and incubated an additional 1.5 hours 
at 60^C. Filters are blotted dry and exposed for autoradiography. If 

30 necessary, filters are washed for a third time at 65-68°C and reexposed to 
film. Other conditions of low stringency which may be used are well known 
in the art (e,g., as employed for cross-species hybridizations). 



! 
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By way of example and not way of limitation, procedures using 
conditions of moderate stringency is provided. For example, but not limited 
to, procedures using such conditions of moderate stringency are as follows: 
Filters containing DNA are pretreated for 6 hours at 55°C in a solution 
5 containing 6X SSC, 5X Denhart's solution, 0.5% SDS and 100;ug/ml 

denatured salmon sperm DNA. Hybridizations are carried out in the same 
solution and 5-20 X 10^ cpm ^^P-labeled probe is used. Filters are incubated 
in hybridization mixture for 18-20 hours at 55°C, and then washed twice for 
30 minutes at 60°C in a solution containing IX SSC and 0.1% SDS. Filters 
10 are blotted dry and exposed for autoradiography. Other conditions of 

moderate stringency which may be used are well-known in the art. Washing 
of filters is done at 37^C for 1 hour in a solution containing 2X SSC, 0.1 % 
SDS. 

By way of example and not way of limitation, procedures using 
15 conditions of high stringency are as follows: Prehybridization of filters 
containing DNA is carried out for 8 hours to overnight at 65°C in buffer 
composed of 6X SSC, 50 mM Tris-HCI (pH 7.5), 1 mM EDTA, 0.02% PVP, 
0.02% Ficoll, 0.02% BSA, and 500 /yg/ml denatured salmon sperm DNA. 
Filters are hybridized for 48 hours at 65 °C in prehybridization mixture 
20 containing 100 jjgfm\ denatured salmon sperm DNA and 5-20 X 10® cpm of 
^2p-labeled probe. Washing of filters is done at 37°C for 1 hour in a solution 
containing 2X SSC, 0.01% PVP, 0.01% Ficoll, and 0.01% BSA. This is 
followed by a wash in 0.1X SSC at 50°C for 45 minutes before 
autoradiography. Other conditions of high stringency which may be used are 
25 well known in the art. 

The term substantially identical or homologous or similar varies with 
the context as understood by those skilled in the relevant art and generally 
means at least 70%, preferably means at least 80%, more preferably at least 
90%, and most preferably at least 95% identity. 
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As used herein, substantially identical to a product means sufficiently 
similar so that the property of interest is sufficiently unchanged so that the 
substantially identical product can be used in place of the product. 

As used herein, substantially pure means sufficiently homogeneous to 
5 appear free of readily detectable impurities as determined by standard 
methods of analysis, such as thin layer chromatography (TLC), gel 
electrophoresis and high performance liquid chromatography (HPLC), used by 
those of skill in the art to assess such purity, or sufficiently pure such that 
further purification would not detectably alter the physical and chemical 
10 properties, such as enzymatic and biological activities, of the substance. 
Methods for purification of the compounds to produce substantially 
chemically pure compounds are known to those of skill in the art. A 
substantially chemically pure compound may, however, be a mixture of 
stereoisomers or isomers. In such instances, further purification might 
15 increase the specific activity of the compound. 

As used herein, target cell refers to a cell that expresses an MTSP in 

vivo. 

As used herein, test substance refers to a chemically defined 
compound (e.g., organic molecules, inorganic molecules, organic/inorganic 

20 molecules, proteins, peptides, nucleic acids, oligonucleotides, lipids, 

polysaccharides, saccharides, or hybrids among these molecules such as 
glycoproteins, etc.) or mixtures of compounds (e.g., a library of test 
compounds, natural extracts or culture supernatants, etc.) whose effect on 
an MTSP, particularly a single chain form that includes the protease domain 

25 or a sufficient portion thereof for activity, as determined by in vitro method, 
such as the assays provided herein. 

As used herein, the terms a therapeutic agent, therapeutic regimen, 
radioprotectant, chemotherapeutic mean conventional drugs and drug 
therapies, including vaccines, which are known to those skilled in the art. 

30 Radiotherapeutic agents are well known in the art. 

As used herein, treatment means any manner in which the symptoms 
of a conditions, disorder or disease are ameliorated or otherwise beneficially 
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altered. Treatment also encompasses any pharmaceutical use of the 
compositions herein. 

As used herein, vector (or plasmid) refers to discrete elements that are 
used to introduce heterologous DNA into cells for either expression or 
5 replication thereof. The vectors typically remain episomal, but may be 

designed to effect integration of a gene or portion thereof into a chromosome 
of the genome. Also contemplated are vectors that are artificial 
chromosomes, such as yeast artificial chromosomes and mammalian artificial 
chromosomes. Selection and use of such vehicles are well known to those of 

10 skill in the art. An expression vector includes vectors capable of expressing 
DNA that is operatively linked with regulatory sequences, such as promoter 
regions, that are capable of effecting expression of such DNA fragments. 
Thus, an expression vector refers to a recombinant DNA or RNA construct, 
such as a plasmid, a phage, recombinant virus or other vector that, upon 

15 introduction into an appropriate host cell, results in expression of the cloned 
DNA. Appropriate expression vectors are well known to those of skill in the 
art and include those that are replicable in eukaryotic cells and/or prokaryotic 
cells and those that remain episomal or those which integrate into the host 
cell genome. 

20 As used herein, protein binding sequence refers to a protein or peptide 

sequence that is capable of specific binding to other protein or peptide 
sequences generally, to a set of protein or peptide sequences or to a 
particular protein or peptide sequence. 

As used herein, epitope tag refers to a short stretch of amino acid 

25 residues corresponding to an epitope to facilitate subsequent biochemical 

and immunological analysis of the epitope tagged protein or peptide. Epitope 
tagging is achieved by appending the sequence of the epitope tag to the 
protein-encoding sequence in an appropriate expression vector. Epitope 
tagged proteins can be affinity purified using highly specific antibodies raised 

30 against the tags. 
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As used herein, metal binding sequence refers to a protein or peptide 
sequence that is capable of specific binding to metal ions generally, to a set 
of metal ions or to a particular metal ion. 

As used herein, a composition refers to a any mixture. It may be a 
5 solution, a suspension, liquid, powder, a paste, aqueous, non-aqueous or any 

combination thereof. 

As used herein, a combination refers to any association between two 

or among more items. 

As used herein, fluid refers to any composition that can flow. Fluids 
10 thus encompass compositions that are in the form of semi-solids, pastes, 
solutions, aqueous mixtures, gels, lotions, creams and other such 
compositions. 

As used herein, a cellular extract refers to a preparation or fraction 
which is made from a lysed or disrupted cell. 

15 'As used herein, an agent is said to be randomly selected when the agent is 
chosen randomly without considering the specific sequences involved in the 
association of a protein alone or with its associated substrates, binding 
partners, etc. An example of randomly selected agents is the use a chemical 
library or a peptide combinatorial library, or a growth broth of an organism. 

20 As used herein, an agent is the to be rationally selected or designed 

when the agent is chosen on a non-random basis which takes into account 
the sequence of the target site and/or its conformation in connection with 
the agent's action. As described in the Examples, there are proposed binding 
sites for serine protease and (catalytic) sites in the protein having SEQ ID 

25 N0:3 or SEQ ID N0:4. Agents can be rationally selected or rationally 

designed by utilizing the peptide sequences that make up these sites. For 
example, a rationally selected peptide agent can be a peptide whose amino 
acid sequence is identical to the ATP or calmodulin binding sites or domains. 
For clarity of disclosure, and not by way of limitation, the detailed 

30 description is divided into the subsections that follow. 
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B. MTSP PROTEINS, MUTEINS, DERIVATIVES AND ANALOGS THEREOF 
MTSPs 

The MTSPs are a family of transmembrane serine proteases that are 
found in mammals and also other species that share a number of common 
5 structural features including: a proteolytic extracellular C-terminal domain; a 
transmembrane domain, with a hydrophobic domain near the N-terminus; a 
short cytoplasmic domain; and a variable length stem region containing 
modular domains. The proteolytic domains share sequence homologoy 
including conserved his, asp, and ser residues necessary for catalytic activity 

10 that are present in conserved motifs. The MTSPs are synthesized as 

zymogens, and activated to double chain forms by cleavage. It is shown 
herein that the single chain proteolytic domain can function in vitro and, 
hence is useful in in vitro assays for identifying agents that modulate the 
activity of members of this family. Also provided are family members 

15 designated MTSP3, MTSP4 and an MTSP6 variant. 

The MTSP family is a target for therapeutic intervention and also 
some, may serve as diagnostic markers for tumor development, growth 
and/or progression. As discussed, the members of this family are involved in 
proteolytic processes that are implicated in tumor development, growth 

20 and/or progression. This implication is based upon their functions as 

proteolytic enzymes in processes related to ECM degradative pathways. In 
addition, their levels of expression or level of activation or their apparent 
activity resulting from substrate levels or alterations in substrates and levels 
thereof differs in tumor cells and non-tumor cells in the same tissue. Hence, 

25 protocols and treatments that alter their activity, such as their proteolytic 

acitivities and roles in signal transduction, and/or their expression, such as by 
contacting them with a compound that modulates their activity and/or 
expression, could impact tumor development, growth and/or progression. 
Also, in some instances, the level of activation and/or expression may be 

30 altered in tumors, such as lung carcinoma, colon adenocarcinoma and 
ovarian carcinoma. 
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The MTSP may serve as a diagnostic marker for tumors. It is shown 
herein, that MTSPS and MTSP4 and the MTSP6 variant provided herein are 
expressed and/or activated in certain tumors; hence their activation or 
expression may serve as a diagnostic marker for tumor developement, 
5 growth and/or progression. In other instances the MTSP protein can exhibit 
altered activity by virtue of a change in activity or expression of a co-factor 
therefor or a substrate therefor. In addition, in some instances, these 
MTSPS and/or variants thereof may be shed from cell surfaces. Detection of 
the shed MTSPS, particularly the extracellular domains, in body fluids, such 

10 as serum, blood, saliva, cerebral spinal fluid, synovial fluid and interstitial 
fluids, urine, sweat and other such fluids and secretions, may serve as a 
diagnostic tumor marker. In particular, detection of higher levels of such 
shed polypeptides in a subject compared to a subject known not to have any 
neoplastic disease or compared to earlier samples from the same subject, can 

15 be indicative of neoplastic disease in the subject. 

Provided herein are isolated substantially pure single polypeptides that 
contain the protease domain of an MTSP as a single chain. The MTSPs 
contemplated herein are not expressed on endothelial cells, and, preferably, 
are expressed on tumor cells, typically at a level that differs from the level in 

20 which they are expressed in the non-tumor cell of the same type. Hence, for 
example, if the MTSP is expressed in an ovarian tumor cell, to be of interest 
herein with respect to ovarian cancer, it is expressed at the same level in 
non-tumor ovarian cells. MTSP protease domains include the single chain 
protease domains of MTSP1, MTSPS, MTSP4, MTSP6 and other such 

25 proteases, including, but are not limited to, corin, enterpeptidase, human 
airway trypsin-like protease (HAT), MTSP1, TMPRS2, and TMPRSS4. 

Provided are the protease domains or proteins that include a portion of 
an MTSP that is the protease domain of any MTSP, particularly an MTSP1, 
MTSP3, MTSP4 and MTSP6. The protein can also include other non-MTSP 

30 sequences of amino acids, but will include the protease domain or a 

sufficient portion thereof to exhibit catalytic activity in any in vitro assay that 
assess such protease activity, such as any provided herein. 
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Also provided herein are nucleic acid molecules that encode MTSP 
proteins and the encoded proteins. In particular, nucleic acid molecules 
encoding MTSP-3 and MTSP-4 from animals, including splice variants thereof 
are provided. The encoded proteins are also provided. Also provided are 
5 functional domains thereof. 

In specific aspects, the MTSP protease domains, portions thereof, 
and muteins thereof are from or based on animal MTSPS, including, but are 
not limited to, rodent, such as mouse and rat; fowl, such as chicken; 
ruminants, such as goats, cows, deer, sheep; ovine, such as pigs; and 
10 humans. 

In particular, MTSP derivatives can be made by altering their 
sequences by substitutions, additions or deletions that provide for 
functionally equivalent molecules. Due to the degeneracy of nucleotide 
coding sequences, other nucleic sequences which encode substantially the 

15 same amino acid sequence as a MTSP gene can be used. These include but 
are not limited to nucleotide sequences comprising all or portions of MTSP 
genes that are altered by the substitution of different codons that encode the 
amino acid residue within the sequence, thus producing a silent change. 
Likewise, the MTSP derivatives include, but are not limited to, those 

20 containing, as a primary amino acid sequence, all or part of the amino acid 
sequence of MTSP, including altered sequences in which functionally 
equivalent amino acid residues are substituted for residues within the 
sequence resulting in a silent change. For example, one or more amino acid 
residues within the sequence can be substituted by another amino acid of a 

25 similar polarity which acts as a functional equivalent, resulting in a silent 
alteration. Substitutes for an amino acid within the sequence may be 
selected from other members of the class to which the amino acid belongs. 
For example, the nonpolar (hydrophobic) amino acids include alanine, leucine, 
isoleucine, valine, proline, phenylalanine, tryptophan and methionine. The 

30 polar neutral amino acids include glycine, serine, threonine, cysteine, 

tyrosine, asparagine, and glutamine. The positively charged (basic) amino 
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acids include arginine, lysine and inistidine. The negatively charged (acidic) 
amino acids include aspartic acid and glutamic acid (see, e,g., Table 1). 

In a preferred embodiment, the substantially purified MTSP protease is 
encoded by a nucleic acid that hybridizes to the a nucleic acid molecule 
5 containing the protease domain encoded by the nucleotide sequence set 
forth in any of SEQ. ID Nos. 1 , 3, 5, 7, 9 or 1 1 under at least moderate, 
generally high, stringency conditions, such that the protease domain 
encoding nucleic acid thereof hybridizes along its full length. In preferred 
embodiments the substantially purified MTSP protease is a single chain 

10 polypeptide that includes substantially the sequence of amino acids set forth 
in any SEQ ID Nos. 2, 4, 6, 8, 10 and 12 that encodes the protease domain. 
Specific sequences for the following human MTSPs and domains thereof are 
provided as follows: SEQ ID No. 3 MTSP3 nucleic acid sequence; SEQ ID 
No. 4 MTSP3 amino acid sequence; SEQ ID No. 5 MTSP4 nucleic acid 

15 encoding the protease domain; SEQ ID No. 6 MTSP4 amino acid sequence of 
the protease domain; SEQ ID No, 7 MTSP4-L nucleic acid sequence; SEQ ID 
No. 8 MTSP4-L amino acid sequence; SEQ ID No. 9 l\/ITSP4-S nucleic acid 
sequence; SEQ ID No. 10 MTSP4-S amino acid sequence; SEQ ID No, 1 1 
MTSP6 nucleic acid sequence; SEQ ID No. 12 MTSP6 amino acid sequence. 

20 SEQ ID No. 1 sets forth the nucleic acid sequence of the long form of 

MTSP1; SEQ ID No. 2 the encoded amino acid sequence; SEQ ID No. 49 
sets forth the sequence of a protease domain of an MTSP1, and SEQ ID No. 
50 the sequence of the encoded single chain protease domain thereof. 
Figures 1-3 depict the structural organization of the MTSP3, MTSP4 and 

25 MTSP6, respectively. 

In particular, exemplary protease domains include at least a sufficient 
portion of sequences of amino acids set forth as amino acids 615-855 in 
SEQ ID No. 2 (encoded by nucleotides 1865-2587 in SEQ ID No. 1; see also 
SEQ ID Nos. 49 and 50) from MTSP1 (matriptase), amino acids 205-437 of 

30 SEQ ID NO. 4 from MTSP3, SEQ ID No. 6, which sets forth the protease 

domain of MTSP4, and amino acids 217-443 of SEQ ID No. 1 1 from MTSP6. 
Also contemplated are nucleic acid molecules that encode a single 
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' chain MTSP protease that have proteolytic activity in an in vitro proteolysis 
assay and that have at least 60%. 70%, 80%, 85%, 90% or 95% sequence 
identity with the full length of a protease domain of an MTSP protein, or that 
hybridize along their full length to a nucleic acids that encode a protease 
5 domain, particularly under conditions of moderate, generally high, stringency. 
As above, the encoded polypeptides contain the protease as a single chain. 

The isolated nucleic acids may include of at least 8 nucleotides of an 
MTSP sequence. In other embodiments, the nucleic acids may contain least 
25 (continuous) nucleotides, 50 nucleotides, 1 00 nucleotides, 150 

10 nucleotides, or 200 nucleotides of a MTSP sequence, or a full-length MTSP 
coding sequence. In another embodiment, the nucleic acids are smaller than 
35, 200 or 500 nucleotides in length. Nucleic acids can be single or double 
stranded. Nucleic acids that hybridizes to or complementary to the foregoing 
sequences, in particular the inverse complement to nucleic acids that 

15 hybridizes to the foregoing sequences (i.e., the inverse complement of a 
nucleic acid strand has the complementary sequence running in reverse 
orientation to the strand so that the inverse complement would hybridize 
without mismatches to the nucleic acid strand; thus, for example, where the 
coding strand is that hybridizes to a nucleic acid with no mismatches 

20 between the coding strand and the that hybridizes strand, then the inverse 
complement of the that hybridizes strand is identical to the coding strand) 
are also provided. In specific aspects, nucleic acids are provided that include 
a sequence complementary to (specifically are the inverse complement of) at 
least 10, 25, 50, 100, or 200 nucleotides or the entire coding region of an 

25 MTSP encoding nucleic acid, particularly the protease domain thereof. For 
MTSP3 and MTSP4 the full-length protein or domain or active fragment 
thereof. 

For each of the nucleic acid molecules, the nucleic acid can be DNA or 
RNA or PNA or other nucleic acid analogs or may include non-natural 
30 nucleotide bases. 
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Also provided are isolated nucleic acid molecules that include a 
sequence of nucleotides complementary to the nucleotide sequence encoding 
an MTSP. 

Probes and primers derived from the nucleic acid molecules are 
5 provided. Such probes and primers contain at least 8, 14, 16, 30, 100 or 
more contiguous nucleotides with identity to contiguous nucleotides of an 
MTSP, including, but are not limited to, MTSP1, MTSP3, MTSP4 and 
MTSP6. The probes and primers are optionally labelled with a detectable 
label, such as a radiolabel or a fluorescent tag, or can be mass differentiated 

10 for detection by mass spectrometry or other means. 

Plasmids and vectors containing the nucleic acid molecules are also 
provided. Cells containing the vectors, including cells that express the 
encoded proteins are provided. The cell can be a bacterial cell, a yeast cell, 
a fungal cell, a plant cell, an insect cell or an animal cell. Methods for 

15 producing an MTSP or single chain form of the protease domain thereof by, 
for example, growing the cell under conditions whereby the encoded MTSP 
is expressed by the cell, and recovering the expressed protein, are provided 
herein. As noted, for MTSP3 and MTSP4, the full-length zymogens and 
activated proteins and activated (two strand) protease and single chain 

20 protease domains are provided. 

Except for the MTSP proteins (MTSP3 and MTSP4) heretofore 
unidentified and provided herein, the isolated polypeptides contain the MTSP 
protease domain or a catalytically active portion thereof and, generally, do 
not contain additional MTSP. Hence isolated, substantially pure proteases 

25 that e the protease domains or catalytically active portion thereof in single 
chain form of MTSPs are provided. The protease domains may be included 
in a longer protein, but such longer protein is not the MTSP zymogen. 

Thus, MTSP3 and MTSP4 proteins are provided. For these proteins, 
the domains, fragments, derivatives or analogs that are functionally active, 

30 i.e., capable of exhibiting one or more functional activities associated with 
the MTSP protein, e.g., serine protease activity, immunogenicity and 
antigenicity, are provided. As discussed above, the protease domains 
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thereof are also provided. For MTSP3 and MTSP4, the zymogens and 
activated forms, and also, the single chain and double chain, activated 
protease domains are provided. 

Also provided are nucleic acid molecules that hybridize to the above- 
noted sequences of nucleotides encoding MTSP3 and MTSP4 (SEQ ID Nos. 
3, 5, 7 and 9) at least at low stringency, more preferably at moderate 
stringency, and most preferably at high stringency, and that encode the 
protease domain and/or the full length protein or other domains of an MTSP 
family member, such as MTSP3, MTSP4, MTSP6 or a splice variant or allelic 
variant thereof, or MTSP6 or a splice variant or allelic variant thereof. 
Preferably the molecules hybridize under such conditions along their full 
length for at least one domain and encode at least one domain, such as the 
protease or extracellular domain, of the polypeptide. In particular, such 
nucleic acid molecules include any isolated nucleic fragment that encodes at 
least one domain of a membrane serine protease, that (1) contains a 
sequence of nucleotides that encodes the protease or a domain thereof, and 
(2) is selected from among: 

(a) a sequence of nucleotides that encodes the protease or a 
domain thereof includes a sequence of nucleotides set forth 
above; 

(b) a sequence of nucleotides that encodes such portion or the full 
length protease and hybridizes under conditions of high 
stringency, preferably to nucleic acid that is complementary to 
a mRNA transcript present in a mammalian cell that encodes 
such protein or fragment thereof; 

(c) a sequence of nucleotides that encodes a transmembrane 
protease or domain thereof that includes a sequence of amino 
acids encoded by such portion or the full length open reading 
frame; and 

(d) a sequence of nucleotides that encodes the transmembrane 
protease that includes a sequence of amino acids encoded by a 
sequence of nucleotides that encodes such subunit and 
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hybridizes under conditions of high stringency to DNA that is 
complementary to the mRNA transcript. 
Exemplary MTSPs 
The above discussion provides an overview and some details of the 
5 exemplified MTSPs. The following discussion provides additional details 
(see, also, EXAMPLES). 

MTSP1 (matriptase) 
Matriptase is a trypsin-like serine protease with broad spectrum 
cleavage activity and two potential regulatory modules. It was named 
10 "matriptase" because its ability to degrade the extra-cellular matrix and its 
trypsin-like activity. When isolated from breast cancer cells (or T-47D cell 
conditioned medium), matriptase has been reported to be primarily in an 
uncomplexed form. Matriptase has been isolated from human milk; when 
isolated from human milk, matriptase was reported to be in one of two 
15 complexed forms, 95 kDa (the predominant form) and 1 10 kDa; 

uncomplexed matriptase was not detected. (Liu, etaL, J. BioL Chem. 
274 (26): 1 8237-1 8242 (1999).) It has been proposed that matriptase exists 
as an uncomplexed protease when in its active state. In breast milk, 
matriptase has been reported to exist in complex with a fragment of 
20 hepatocyte growth factor inhibitor-1 (HAI-1), a Kuntz-type serine protease 
inhibitor having activity against trypsin-like serine proteases. 

Ecotin and Ecotin M84R/M85R are macromolecular inhibitors of serine 
proteases of the chymotrypsin fold and inhibit ductal branching, 
morphogenesis and differentiation of the explanted ductal prostate. PC-3 is 
25 a cell line derived from prostate cancer epithelial cells. Ecotin and 

M84R/M85R ecotin were found to decrease tumor size and metastasis in PC- 
3 implanted nude mice. 

Matriptase has been isolated and its encoding nucleic acids cloned 
from T-47D human breast cancer cell-conditioned medium (Lin et aL (1999) 
30 J. BioL Chem. 274:18231-18236). Upon analysis of the cDNA, it was 

determined that the full length protease has 683 amino acids and contains 
three main structural regions: a serine protease domain near the carboxyl- 
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terminal region, four tandenn low-density lipoprotein receptor domains, and 
two tandem complement subcomponents Cir and CIs. 

Studies to identify additional serine proteases made by cancer cells 
were done using PC-3 cells. A serine protease termed ''MT-SPI", reported 
5 to be a transmembrane protease was cloned (Takeuchi et al. (1999) Proc. 
NatL Acad. ScL U.S.A. 56:1 1054-1 1061). It was subsequently found the 
originally identified matriptase sequence is included in the translated 
sequence of the cDNA that encodes MT-SP1 . The matriptase cDNA was 
reported to be a partial MT-SP1 cDNA and to lack 516 of the coding 

10 nucleotides (Takeuchi, eta/., J. Biol. Chem 275:26333-26342 (2000).) 

Since the reported matriptase encoding cDNA sequence encoded a possible 
initiating methionine, it was proposed that alternative splicing could yield a 
protein lacking the N-terminal region of MTSP1 . 

Matriptase and MT-SP1 demonstrate trypsin-like protease activity and 

15 are Type II transmembrane proteins with a common extracellular protease 
domain. Studies of substrate specificity of MT-SP1 reveal that 

protease-activated receptor 2 (PAR2) and single-chain urokinase-type 
plasminogen activator (sc-uPA) are macromoiecular substrates of MT-SP1 . 
PAR2 is functions in inflammation, cytoprotection and/or cell adhesion, while 

20 sc-uPa is functions in tumor cell invasion and metastasis. 

An exemplary nucleotide sequences encoding a human MTSP1 is set 
forth in SEQ ID Nos 1 and 2 (see also SEQ ID Nos. 49 and 50 for the 
protease domain thereof). As previously noted SEQ ID No. 1 sets for an 
MTSP1 -encoding nucleic acid sequence. This sequence is the longer version 

25 and includes the protease domain, which is common to both variants 

Nucleic acids encoding the MTSP that hybridizes to the nucleotide sequence 
set forth in SEQ ID No. 1 can be obtained by any method known in the art, 
e.g, by PGR amplification using synthetic primers that hybridizes to the 3' 
and 5' ends of the sequence and/or by cloning from a cDNA or genomic 

30 library using a PGR amplification product or an oligonucleotide specific for 
the gene sequence (e.g., as described in Section G herein). Homologs (e.g., 
nucleic acids of the above-listed genes of species other than human) or other 
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related sequences (e.g., paralogs) and muteins can be obtained by low, 
moderate or high stringency hybridization with all or a portion of the 
particular sequence provided as a probe using methods well known in the art 
for nucleic acid hybridization and cloning. 
5 Isolated single chain protease domains of MTSP1 proteins from 

animals are provided herein. As shown herein, the single chain protease 
domain is catalytically active and can be used in a variety of drug screening 
assays, particularly in vitro proteolytic assays. Exemplary MTSP protease 
domains are set forth as the amino acids (615-855 of SEQ ID No. 2) encoded 

10 by nucleotides 1865-2587 of SEQ ID No. 1 (see, also, SEQ ID Nos. 49 and 
50). The MTSP1 single chain protease domain is catalytically active 

Muteins of the MTSP1 proteins are provided. In the activated double 
chain molecule, residue 731 forms a disulfide bond with the Cys at residue 
604. In the single chain form, the residue at 731 in the protease domain is 

15 free. Muteins in which Cys residues, particularly the free Cys residue 

(amino acid 731 in SEQ ID No. 2) in the single chain protease domain are 
provided. Other muteins in which conservative amino acids replacements are 
effected and that retain proteolytic activity as a single chain are also 
provided. Such changes may be systematically introduced and tested for 

20 activity in in vitro assays, such as those provided herein. 

MTSP3 

In a specific embodiment, a nucleic acid that encodes a MTSP, 
designated MTSP3 is provided. In particular, the nucleic acid includes an 
open reading frame within the following sequence of nucleotides set forth in 

25 SEQ ID No. 3. In particular the protein is encoded by the open 
reading frame that begins at nucleotide 261 and ends at 1574. 

Also provided are nucleic acid molecules that hybridize under 
conditions of at least low stringency, preferably moderate stringency, 
more preferably high stringency to the following sequence of nucleic acids 

30 (SEQ ID No. 3), particularly to the open reading frame encompassed by 
nucleotides that encode a single protease domain thereof, or any domain 
of MTSP3 
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Also included are substantially purified MTSP3 zymogen, activated 
double chains, single chain protease domains and double chain protease 
domains. These are encoded by a nucleic acid that includes sequence 
encoding a protease domain that exhibits proteolytic activity and that 
5 hybridizes to a nucleic acid molecule having a nucleotide sequence set 
forth in SEQ ID No. 3, typically under moderate, generally under high 
stringency, conditions and most preferably along the full length of the 
protease domain. Splice variants are also contemplated herein. 

In a preferred embodiment, the isolated nucleic acid fragment 

10 hybridizes to the nucleic acid having the nucleotide sequence set forth in 
SEQ ID No: 3 under high stringency conditions, and preferably comprises 
the sequence of nucleotides set forth in any of SEQ ID Nos. 3 or 
comprises a portion thereof that encodes a transmembrane domain and 
may additionally include a LDLR domain, a scavenger-receptor cysteine 

15 rich (SRCR) domain and a serine protease catalytic domain or any other 
identified domain (see FIGURES) or comprises nucleic acid molecule that 
encodes the protein encoded by SEQ ID NO. 4. 

The isolated nucleic acid fragment is DNA, including genomic or 
cDNA, or is RNA, or can include other components, such as protein 

20 nucleic acid. The isolated nucleic acid may include additional 

components, such as heterologous or native promoters, and other 
transcriptional and translational regulatory sequences, these genes may 
be linked to other genes, such as reporter genes or other indicator genes 
or genes that encode indicators. 

25 Also provided is an isolated nucleic acid molecule that includes the 

sequence of molecules that is complementary to the nucleotide sequence 
encoding the MTSP or the portion thereof. 

Also provided are fragments thereof that can be used as probes or 
primers and that contain at least about 10 nucleotides, more preferably 

30 14 nucleotides, more preferably at least about 16 nucleotides, most 
preferably at least about 30 nucleotides. 
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Hence provided herein are polypeptides that are encoded by such 
nucleic acid molecules. Included among those polypeptides are the 
MTSPS protease domain or a polypeptide with conservative amino acid 
changes such that the specificity and protease activity remains 
5 substantially unchange. In particular, a substantially purified mammalian 
MTSP protein is provided that has a transmembrane domain and may 
additionally include a CUB domain, one or more of an LDLR domain{s), a 
scavenger-receptor cysteine rich (SRCR) domain and a serine protease 
catalytic domain is provided, 

10 Also provided is a substantially purified protein comprising a 

sequence of amino acids that has at least 60%, more preferably at least 
about 90%, most preferably at least about 95%, identity to the MTSPS, 
wherein the percentage identity is determined using standard algorithms 
and gap penalties that maximize the percentage identity. The human 

15 MTSPS protein is most preferred, although other mammalian MTSPS 
proteins are contemplated. 

Muteins of MTSPS, particularly those in which Cys residues, such 
as the CysSIO in the single chain protease domain, is replaced with 
another amino acid that does not eliminate the activity, are provided. 



Among the proteins provided herein is MTSP4. MTSP4 is highly 
expressed in the liver, and is expressed in substantially lower levels in 
other tissues (see, EXAMPLES). It is also expressed in non-liver-derived 
tumors (see EXAMPLES), including Burkitt's lymphoma, colorectal 
25 adenocarcinoma (SW480), lung carcinoma (A549), and in leukemic cells, 
indicating a role in one or more of tumor progression, tumor invasion, 
tumor growth and tumor metastases. 



20 



MTSP4 



-60- 




24745-1607 



Also provided are nucleic acid molecules that hybridize under 
conditions of at least low stringency, preferably moderate stringency, 
more preferably high stringency to the sequence of nucleic acids set forth 
in SEQ ID Nos. 5, 7 or 9), particularly to the open reading frame 
5 encompassed by nucleotides that encode a single protease domain 
thereof, or any domain of an I\/1TSP4. 

Also included are substantially purified MTSP4 zymogens, activated 
double chains, single chain protease domains and double chain protease 
domains. These are encoded by a nucleic acid that includes sequence 

10 encoding a protease domain that exhibits proteolytic activity and that 
hybridizes to a nucleic acid molecule having a nucleotide sequence set 
forth in SEQ ID Nos. 5, 7 and 9, typically under moderate, generally 
under high stringency, conditions and most preferably along the full 
length of the protease domain. 

15 In a preferred embodiment, the isolated nucleic acid fragment 

hybridizes to the nucleic acid having the nucleotide sequence set forth in 
SEQ ID No: 5, 7 or 9 under high stringency conditions, and preferably 
comprises the sequence of nucleotides set forth in any of SEQ ID Nos. 5, 
7 or 9 comprises a portion thereof that encodes a transmembrane 

20 domain and may additionally include a LDLR domain, a scavenger-receptor 
cysteine rich (SRCR) domain and a serine protease catalytic domain or 
any other identified domain (see FIGURES) or comprises nucleic acid 
molecule that encodes the protein encoded by SEQ ID NO. 6, 9 or 10.. 
The isolated nucleic acid fragment is DNA, including genomic or 

25 cDNA, or is RNA, or can include other components, such as protein 
nucleic acid. The isolated nucleic acid may include additional 
components, such as heterologous or native promoters, and other 
transcriptional and translational regulatory sequences, these genes may 
be linked to other genes, such as reporter genes or other indicator genes 

30 or genes that encode indicators. 
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Also provided is an isolated nucleic acid molecule that includes the 
sequence of molecules'that is complementary to the nucleotide sequence 
encoding and MTSP4 or the portion thereof. 

Also provided are fragments thereof that can be used as probes or 
5 primers and that contain at least about 10 nucleotides, more preferably 
14 nucleotides, more preferably at least about 16 nucleotides, most 
preferably at least about 30 nucleotides. 

In particular nucleic acid molecules encoding two forms of MTSP4 
are provide. The encoded proteins are multi-domain, type II 
10 membrane-type serine proteases and include a transmembrane domain at 
the N terminus followed by a CUB domain, 3 LDLR domains and a 
trypsin-like serine protease domain at the C terminus. The difference 
between the two forms, which are splice variants, is the absence in 
MTSP4-S of a 432-bp nucleotide sequence between the transmembrane 
15 and the CUB domains (see FIGURE 2; see, also SEQ ID Nos. 5-10). 

Also provided is a nucleic acid that encodes the extracellular 
protease domain of an MTSP4 is provided. Both forms of MTSP4 
exemplified herein include a protease domain in common (see SEQ ID 
Nos. 5 and 6). 

20 In particular, the extracellular protease domain of the MTSP4 

proteins is encoded by the open reading frame that begins at nucleotide 1 
and ends at 708 (TGA) (SEQ ID No. 5. This open reading frame encodes 
a portion of the MSTP4 protein and includes the protease domain. Full 
length MSTP4 proteins (SEQ ID Nos. 7 and 9) include the above domain. 

25 The extracellular protease domain, as a single chain, and also an activated 
double chain, exhibit protease activity. The disulfide bonds that form that 
two chain form of MTSP forms are likely between Cys415 and Cys535 
for MTSP4-S, and between Cys559 and Cys679 for MTSP4-L. 

For use of the single chain protease domain thereof, it is of interest 

30 to replace the free Cys (i.e. Cys535 (Cys679)) in the protease domain 
with another amino acid, such as any amino acid that does not alter the 
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function (such change is likely to be any amino acid). Thus, muteins of 
MTSP4, particularly those in which Cys residues, such as the Cys535 and 
Cys679 in the single chain protease domains of MTSP4-S and MTSP4-L, 
respectively, are provided. 
5 MTSP6 

Nucleic acid and the encoded MTSP6 protein of an exemplary 
MTSPe are also provided. The respective sequences are set forth in SEQ 
ID Nos. 1 1 and 12. The MTSPG DNA and protein sequences were 
analyzed using DNA Strider (version 1 .2). The ORF encoding the MTSP6 

10 variant provided herein is composed of 1,362 bp, which translate into a 
453-amino acid protein. MTSP6 is a multi-domain, type-ll membrane-type 
serine protease containing a transmembrane domain (amino acids 48-68) 
at the N-terminus followed by a LDLRa domain (LDL receptor domain 
class a) (amino acids 72-108), a SR domain (Scavenger receptor Cys-rich 

15 domain)(amino acids 109-205), and a trypsin-like serine protease domain 
(amino acids 216-443) (see FIGURE 3). Muteins of MTSP6, particularly 
those in which Cys residues, such as the Cys324 in the single chain 
protease domain of MTSP6 are provided. 

International PCT application No. WO 00/52044 describes MTSPs 

20 that resemble the MTSP6 provided herein. The polypeptide provided 
therein differs at single amino acid positions, such as 90 in SEQ ID No. 
12 (Ala is replaced with a Thr), and significantly from the instant MTSP6 
in that ten amino acids (amino acid nos. 46-55 in SEQ ID No. 12) are 
replaced with the eleven amino acids: phe glu val phe ser gin ser ser ser 

25 leu gly (SEQ ID No. 59) resulting in a protein that is one 454 amino acids 
long. 

There are a few other amino acid sequence differences and a 
number of nucleic acid sequence differences. Significantly, there are 
substantial differences in the protease domain at amino acids 368-394 
30 (368 ICNHRDVYGGIISPSMLCAGYLTGGVD- — 394; SEQ ID No. 12) are 
replaced at position 369-396 with animo acids: 
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369 DLQPQ— -GRVRWHHLPLHALRGLPDGWRWN 396, where the 
differences from 368-394 (Seq ID No, 12) are indicated. 

In addition, a second C-terminus truncated variant with an altered 
protease domain is identified in the PCT application. The variant is the 
same as the 454 variant through amino acid 261 thereof (corresponding 
to 160 of SEQ ID No. 12 herein), followed by 33 amino acids (see SEQ ID 
No. 60 herein) that differ by virtue of a frame shift. 
C. Tumor specificity and tissue expression profiles 

Each MTSP has a characteristic tissue expression profile; the 
MTSPs in particular, although not exclusively expressed or activated in 
tumors, exhibit characteristic tumor tissue expression or activation 
profiles. In some instances, MTSPs may have different activity in a tumor 
cell from a non-tumor cell by virtue of a change in a substrate or cofactor 
therefor or other factor that would alter the apparent of functional activity 
of the MTSP. Hence each can serve as a diagnostic marker for particular 
tumors, by virtue of a level of activity and/or expression or function in a 
subject (i.e. a mammal, particularly a human) with neoplastic disease, 
compared to a subject or subjects that do not have the neoplastic 
disease. In addition, detection of activity (and/or expression) in a 
particular tissue can be indicative of neoplastic disease. Shed MTSPs in 
body fluids can be indicative of neoplastic disease. Also, by virtue of the 
activity and/or expression profiles of each, they can serve as therapeutic 
targets, such as by administration of modulators of the activity thereof, 
or, as by administration of a prodrug specifically activated by one of the 
MTSPs. 

Tissue expression profiles 
MTSPS 

The MTSP3 transcript was detected in lung carcinoma (LX-1), 
colon adenocarcinoma (CX-1), colon adenocarcinoma (GI-112) and 
ovarian carcinoma (GI-102). No apparent signal was detected in another 
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form of lung carcinoma (GI-117), breast carcinoma (GI-101), pancreatic 
adenocarcinoma (GI-103) and prostatic adenocarcinoma (PCS). 
MTSP1 is expressed in breast cancers. 

MTSP4 

The MTSP4 transcript, a DNA fragment encoding part of the LDL 
receptor domain and the protease domain was used to probe an RNA blot 
composed of 76 different human tissues (catalog number 7775-1; human 
multiple tissue expression (MTE) array; CLONTECH). As in the northern 
analysis of gel blot, a very strong signal was observed in the liver. 
Signals in other tissues were observed in (decreasing signal level): fetal 
liver > heart = kidney = adrenal gland = testis = fetal heart and kidney 
= skeletal muscle = bladder = placenta > brain = spinal cord = colon 
= stomach = spleen = lymph node = bone marrow = trachea = uterus 
= pancreas = salivary gland = mammary gland = lung. I\/ITSP4 is also 
expressed less abundantly in several tumor cell lines including HeLa S3 = 
leukemia K-562 = Burkitt's lymphomas (Raji and Daudi) = colorectal 
adenocarcinoma (SW480) > lung carcinoma (A549) = leukemia MOLT-4 
= leukemia HL-60. PGR of the MTSP4 transcript from cDNA libraries 
made from several human primary tumors xenografted in nude mice 
(human tumor multiple tissue cDNA panel, catalog number K1 522-1, 
CLONTECH) was performed using MTSP4-specific primers. The MTSP4 
transcript was detected in breast carcinoma (GI-101), lung carcinoma 
(LX-1), colon adenocarcinoma (GI-112) and pancreatic adenocarcinoma 
(GI-103). No apparent signal was detected in another form of lung 
carcinoma {GI-117), colon adenocarcinoma (CX-1), ovarian carcinoma 
(GI-102). and prostatic adenocarcinoma (PC3). The MTSP4 transcript 
was also detected in LNCaP and PC-3 prostate cancer cell lines as well as 
in HT-1080 human fibrosarcoma cell line. 
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Gene expression profile of MTSP6 in normal and tumor 
tissues 

To obtain information regarding the gene expression profile of the 
MTSP6 transcript, a 495 bp DNA fragment obtained from PGR reaction 
5 with primers Ch17-NSP-3 and NSP-4AS was used to probe an RNA blot 
composed of 76 different human tissues (catalog number 7775-1; human 
multiple tissue expression (MTE) array; CLONTECH). The strongest signal 
was observed in duodenum. Signal in other tissues were observed in 
(decreased signal level): Stomach > trachea = mammary gland = 

10 thyroid gland = salivary gland = pituitary gland = pancreas > kidney > 
lung > jejunum = ileum = ilocecum = appendix = fetal kidney > fetal 
lung. Very weak signals can also be detected in several other tissues. 

MTSP6 is also expressed in several tumor cell lines including HeLa 
S3 > colorectal adenocarcinoma (SW480) > leukemia MOLT-4 > 

15 leukemia K-562. PGR analysis of the MTSP6 transcript from cDNA 

libraries made from several human primary tumors xenografted in nude 
mice (human tumor multiple tissue cDNA panel, catalog number K1 522-1, 
GLONTEGH) was performed using MTSP6-specific primers (Gh17-NSP-3 
and Ghl 7-NSP2AS). The MTSP6 transcript was strongly detected in lung 

20 carcinoma (LX-I), moderately detected in pancreatic adenocarcinoma 

(GI-103), weakly detected in ovarian carcinoma (Gl-102); and very weakly 
detected in colon adenocarcinoma (GI-1 12 and GX-1), breast carcinoma 
(GI-101), lung carcinoma (GI-1 17) and prostatic adenocarcinoma (PG3). 
The MTSP6 transcript was also detected in breast cancer cell line 

25 MDA-MB-231 , prostate cancer cell line PG-3, but not in HT-1080 human 
fibrosarcoma cell line. MTSP6 is also expressed in mammary gland 
carcinoma cDNA (Glontech). MTSPG is also over expressed in ovarian 
tumor cells. 
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D. Identification and isolation of MTSP protein genes 

The MTSP proteins, or domains thereof, can be obtained by 
methods well known in the art for protein purification and recombinant 
protein expression. Any method known to those of skill in the art for 
5 identification of nucleic acids that encode desired genes may be used. 
Any method available in the art can be used to obtain a full length (/.e,, 
encompassing the entire coding region) cDNA or genomic DNA clone 
encoding an MTSP protein. In particular, the polymerase chain reaction 
(PCR) can be used to amplify a sequence identified as being differentially 

10 expressed in normal and tumor cells or tissues, e.g., nucleic acids 
encoding an MTSP protein (SEQ. NOs: 1-12), in a genomic or cDNA 
library. Oligonucleotide primers that hybridize to sequences at the 3' and 
5' termini of the identified sequences can be used as primers to amplify 
by PCR sequences from a nucleic acid sample (RNA or DNA), preferably a 

15 cDNA library, from an appropriate source (e.g., tumor or cancer tissue). 

PCR can be carried out, e.g., by use of a Perkin-Elmer Cetus 
thermal cycler and Taq polymerase (Gene Amp"). The DNA being 
amplified can include mRNA or cDNA or genomic DNA from any 
eukaryotic species. One can choose to synthesize several different 

20 degenerate primers, for use in the PCR reactions. It is also possible to 
vary the stringency of hybridization conditions used in priming the PCR 
reactions, to amplify nucleic acid homologs (e.g., to obtain MTSP protein 
sequences from species other than humans or to obtain human sequences 
with homology to MTSP protein) by allowing for greater or lesser degrees 

25 of nucleotide sequence similarity between the known nucleotide sequence 
and the nucleic acid homolog being isolated. For cross species 
hybridization, low stringency conditions are preferred. For same species 
hybridization, moderately stringent conditions are preferred. After 
successful amplification of the nucleic acid containing all or a portion of 

30 the identified MTSP protein sequence or of a nucleic acid encoding all or a 
portion of an MTSP protein homolog, that segment may be molecularly 
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cloned and sequenced, and used as a probe to isolate a complete cDNA 
or genomic clone. This, in turn, will permit the determination of the 
gene's complete nucleotide sequence, the analysis of its expression, and 
the production of its protein product for functional analysis. Once the 
nucleotide sequence is determined, an open reading frame encoding the 
MTSP protein gene protein product can be determined by any method 
well known in the art for determining open reading frames, for example, 
using publicly available computer programs for nucleotide sequence 
analysis. Once an open reading frame is defined, it is routine to 
determine the amino acid sequence of the protein encoded by the open 
reading frame. In this way, the nucleotide sequences of the entire MTSP 
protein genes as well as the amino acid sequences of MTSP protein 
proteins and analogs may be identified. 

Any eukaryotic cell potentially can serve as the nucleic acid source 
for the molecular cloning of the MTSP protein gene. The nucleic acids 
can be isolated from vertebrate, mammalian, human, porcine, bovine, 
feline, avian, equine, canine, as well as additional primate sources, 
insects, plants, etc. The DNA may be obtained by standard procedures 
known in the art from cloned DNA (e.g., a DNA "library"), by chemical 
synthesis, by cDNA cloning, or by the cloning of genomic DNA, or 
fragments thereof, purified from the desired cell (see, for example, 
Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual, 2d Ed., 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York; 
Glover, D.M. (ed.), 1985, DNA Cloning: A Practical Approach, MRL Press, 
Ltd., Oxford, U.K. Vol. I, II). Clones derived from genomic DNA may 
contain regulatory and intron DNA regions in addition to coding regions; 
clones derived from cDNA will contain only exon sequences. Whatever 
the source, the gene should be molecularly cloned into a suitable vector 
for propagation of the gene. 

In the molecular cloning of the gene from genomic DNA, DNA 
fragments are generated, some of which will encode the desired gene. 
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The DNA may be cleaved at specific sites using various restriction 
enzymes. Alternatively, one may use DNAse in the presence of 
manganese to fragment the DNA, or the DNA can be physically sheared, 
for example, by sonication. The linear DNA fragments can then be 
separated according to size by standard techniques, including but not 
limited to, agarose and polyacrylamide gel electrophoresis and column 
chromatography. 

Once the DNA fragments are generated, identification of the 
specific DNA fragment containing the desired gene may be accomplished 
in a number of ways. For example, a portion of the MTSP protein (of any 
species) gene {e.g., a PGR amplification product obtained as described 
above or an oligonucleotide having a sequence of a portion of the known 
nucleotide sequence) or its specific RNA, or a fragment thereof be purified 
and labeled, and the generated DNA fragments may be screened by 
nucleic acid hybridization to the labeled probe (Benton and Davis, Science 
196:180 (1977); Grunstein and Hogness, Proc. Natl. Acad. Sci. U.S.A. 
72:3961 (1975)). Those DNA fragments with substantial homology to 
the probe will hybridize. It is also possible to identify the appropriate 
fragment by restriction enzyme digestion(s) and comparison of fragment 
sizes with those expected according to a known restriction map if such is 
available or by DNA sequence analysis and comparison to the known 
nucleotide sequence of MTSP protein. Further selection can be carried 
out on the basis of the properties of the gene. Alternatively, the presence 
of the gene may be detected by assays based on the physical, chemical, 
or immunological properties of its expressed product. For example, cDNA 
clones, or DNA clones which hybrid-select the proper mRNA, can be 
selected which produce a protein that, e.g., has similar or identical 
electrophoretic migration, isolectric focusing behavior, proteolytic 
digestion maps, antigenic properties, serine protease activity. If an anti- 
MTSP protein antibody is available, the protein may be identified by 
binding of labeled antibody to the putatively MTSP protein synthesizing 
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clones, in an ELISA (enzyme-linked immunosorbent assay)-type 
procedure. 

Alternatives to isolating the MTSP protein genomic DNA include, 
but are not limited to, chemically synthesizing the gene sequence from a 
known sequence or making cDNA to the mRNA that encodes the MTSP 
protein. For example, RNA for cDNA cloning of the MTSP protein gene 
can be isolated from cells expressing the protein. The identified and 
isolated nucleic acids can then be inserted into an appropriate cloning 
vector, A large number of vector-host systems known in the art may be 
used. Possible vectors include, but are not limited to, plasmids or 
modified viruses, but the vector system must be compatible with the host 
cell used. Such vectors include, but are not limited to, bacteriophages 
such as lambda derivatives, or plasmids such as pBR322 or pUC plasmid 
derivatives or the Bluescript vector (Stratagene, La Jolla, CA). The 
insertion into a cloning vector can, for example, be accomplished by 
ligating the DNA fragment into a cloning vector which has complementary 
cohesive termini. I the complementary restriction sites used to fragment 
the DNA are not present in the cloning vector, the ends of the DNA 
molecules may be enzymatically modified. Alternatively, any site desired 
may be produced by ligating nucleotide sequences (linkers) onto the DNA 
termini; these ligated linkers may comprise specific chemically 
synthesized oligonucleotides encoding restriction endonuclease 
recognition sequences. In an alternative method, the cleaved vector and 
MTSP protein gene may be modified by homopolymeric tailing. 
Recombinant molecules can be introduced into host cells via 
transformation, transfection, infection, electroporation, etc., so that many 
copies of the gene sequence are generated. 

In an alternative method, the desired gene may be identified and 
isolated after insertion into a suitable cloning vector in a "shot gun" 
approach. Enrichment for the desired gene, for example, by size 
fractionization, can be done before insertion into the cloning vector. 
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In specific embodiments, transformation of host ceils with 
recombinant DNA molecules that incorporate the isolated MTSP protein 
gene, cDNA, or synthesized DNA sequence enables generation of multiple 
copies of the gene. Thus, the gene may be obtained in large quantities by 
5 growing transformants, isolating the recombinant DNA molecules from 
the transformants and, when necessary, retrieving the inserted gene from 
the isolated recombinant DNA. 

E. Vectors, plasmids and cells that contain nucleic acids encoding an 
MTSP protein or protease domain thereof and expression of MTSP 
10 proteins 

Vectors and cells 

For recombinant expression of one or more of the MTSP proteins, 
the nucleic acid containing all or a portion of the nucleotide sequence 
encoding the MTSP protein can be inserted into an appropriate expression 
15 vector, i.e., a vector that contains the necessary elements for the 

transcription and translation of the inserted protein coding sequence. The 
necessary transcriptional and translational signals can also be supplied by 
the native promoter for MTSP genes, and/or their flanking regions. 

Also provided are vectors that contain nucleic acid encoding the 
20 MTSPs. Cells containing the vectors are also provided. The cells include 
eukaryotic and prokaryotic cells, and the vectors are any suitable for use 
therein. 

Prokaryotic and eukaryotic cells, including endothelial cells, 
containing the vectors are provided. Such cells include bacterial cells, 

25 yeast cells, fungal cells, plant cells, insect cells and animal cells. The 

cells are used to produce an MTSP protein or protease domain thereof by 
growing the above-described cells under conditions whereby the encoded 
MTSP protein or protease domain of the MTSP protein is expressed by the 
cell, and recovering the expressed protease domain protein. For purposes 

30 herein, the protease domain is preferably secreted into the medium. 

In one embodiment, the vectors include a sequence of nucleotides 
that encodes a polypeptide that has protease activity and contains all or a 
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portion of only the protease domain, or multiple copies thereof, of an 
MTSP protein are provided. Also provided are vectors that comprise a 
sequence of nucleotides that encodes the protease domain and additional 
portions of an MTSP protein up to and including a full length MTSP 
5 protein, as well as multiple copies thereof, are also provided. The 
vectors may selected for expression of the MTSP protein or protease 
domain thereof in the cell or such that the MTSP protein is expressed as a 
transmembrane protein. Alternatively, the vectors may include signals 
necessary for secretion of encoded proteins. When the protease domain 

10 is expressed the nucleic acid is preferably linked to a secretion signal, 

such as the Saccharomyces cerevisiae a mating factor signal sequence or 
a portion thereof. 

A variety of host-vector systems may be used to express the 
protein coding sequence. These include but are not limited to mammalian 

15 cell systems infected with virus (e.g. vaccinia virus, adenovirus, etc.); 

insect cell systems infected with virus (e.g. baculovirus); microorganisms 
such as yeast containing yeast vectors; or bacteria transformed with 
bacteriophage, DNA, plasmid DNA, or cosmid DNA. The expression 
elements of vectors vary in their strengths and specificities. Depending 

20 on the host-vector system used, any one of a number of suitable 
transcription and translation elements may be used. 

Any methods known to those of skill in the art for the insertion of 
DNA fragments into a vector may be used to construct expression vectors 
containing a chimeric gene containing of appropriate 

25 transcriptional/translational control signals and protein coding sequences. 
These methods may include in vitro recombinant DNA and synthetic 
techniques and in vivo recombinants (genetic recombination). Expression 
of nucleic acid sequences encoding MTSP protein, or domains, 
derivatives, fragments or homologs thereof, may be regulated by a 

30 second nucleic acid sequence so that the genes or fragments thereof are 
expressed in a host transformed with the recombinant DNA molecule(s). 
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For example, expression of the proteins may be controlled by any 
promoter/enhancer known in the art. In a specific embodiment, the 
promoter is not native to the genes for MTSP protein. Promoters which 
may be used include but are not limited to the SV40 early promoter 
5 (Bernoist and Chambon, Nature 290 :304-310 (1981)), the promoter 
contained in the 3' long terminal repeat of Rous sarcoma virus 
(Yamamoto et a!., Ce// 22:787-797 (1980)), the herpes thymidine kinase 
promoter (Wagner et al., Proc. Natl. Acad. Sci. USA 78:1441-1445 
(1981)), the regulatory sequences of the metallothionein gene (Brinster et 

10 al.. Nature 296 :39-42 (1982)); prokaryotic expression vectors such as the 
yff-lactamase promoter (Villa-Kamaroff et al., Proc, Natl. Acad. Sci. USA 
75:3727-3731 1978)) or the tac promoter (DeBoer et a!., Proc. Natl. 
Acad. Sci. USA 80:21-25 (1983)); see also "Useful Proteins from 
Recombinant Bacteria": in Scientific American 242 :79-94 (1980)); plant 

15 expression vectors containing the nopaline synthetase promoter (Herrar- 
Estrella et al.. Nature 303:209-213 (1984)) or the cauliflower mosaic 
virus 35S RNA promoter (Garder et al.. Nucleic Acids Res. 9:2871 
(1981)), and the promoter of the photosynthetic enzyme ribulose 
bisphosphate carboxylase (Herrera-Estrella et al.. Nature 310 :1 15-120 

20 (1984)); promoter elements from yeast and other fungi such as the Gal4 
promoter, the alcohol dehydrogenase promoter, the phosphoglycerol 
kinase promoter, the alkaline phosphatase promoter, and the following 
animal transcriptional control regions that exhibit tissue specificity and 
have been used in transgenic animals: elastase I gene control region 

25 which is active in pancreatic acinar cells (Swift et a!., Cell 38:639-646 
(1984); Ornitz et al.. Cold Spring Harbor Symp. Quant. Biol. 50:399-409 
(1986); MacDonald, Hepatology Z'^2^-b^b (1987)); insulin gene control 
region which is active in pancreatic beta cells (Hanahan et al.. Nature 
315 :11 5-1 22 (1985)), immunoglobulin gene control region which is active 

30 in lymphoid cells (Grosschedl et al.. Cell 38:647-658 (1984); Adams et 
al.. Nature 318 :533-538 (1985); Alexander et al., MoL Cell Biol. 7:1436- 
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1444 (1987)), mouse mammary tumor virus control region which is active 
in testicular, breast, lymphoid and mast cells (Leder et a!.. Cell 45:485- 
495 (1986)), albumin gene control region which is active in liver (Pinckert 
et al., Genes and DeveL 1:268-276 (1987)), alpha-fetoprotein gene 
5 control region which is active in liver (Krumlauf et al., Mol. Cell. Biol. 

5:1639-1648 (1985); Hammer et al.. Science 235 :53-58 1987)), alpha-1 
antitrypsin gene control region which is active in liver (Kelsey et al.. 
Genes and DeveL 1:161-171 (1987)), beta globin gene control region 
which is active in myeloid cells (Mogram et al.. Nature 31 5 :338-340 

10 (1985); Kollias et al., Ce// 46:89-94 (1986)), myelin basic protein gene 
control region which is active in oligodendrocyte cells of the brain 
(Readhead et al., Ce// 48:703-71 2 (1987)), myosin light chain-2 gene 
control region which is active in skeletal muscle (Sani, Nature 31 4 :283- 
286 (1985)), and gonadotrophic releasing hormone gene control region 

15 which is active in gonadotrophs of the hypothalamus (Mason et al.. 
Science 234:1372-1378 (1986)). 

In a specific embodiment, a vector is used that contains a promoter 
operably linked to nucleic acids encoding an MTSP protein, or a domain, 
fragment, derivative or homolog, thereof, one or more origins of 

20 replication, and optionally, one or more selectable markers (e.g., an 
antibiotic resistance gene). Expression vectors containing the coding 
sequences, or portions thereof, of an MTSP protein, is made, for example, 
by subcloning the coding portions into the EcoRI restriction site of each of 
the three pGEX vectors (glutathione S-transferase expression vectors 

25 (Smith and Johnson, Gene 7:31-40 (1988)). This allows for the 

expression of products in the correct reading frame. Preferred vectors 
and systems for expression of the protease domains of the MTSP proteins 
are well known Pichia vectors (available, for example, from Invitrogen, 
San Diego, CA), particularly those designed for secretion of the encoded 

30 proteins. One exemplary vector is described in the EXAMPLES. 
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Plasmids for transformation of coli cells, include, for example, 
the pET expression vectors (see, U.S patent 4,952,496; available from 
NOVAGEN, Madison, Wl; see, also literature published by Novagen 
describing the system). Such plasmids include pET 11a, which contains 
5 the T7lac promoter, T7 terminator, the inducible coli lac operator, and 
the lac repressor gene; pET 12a-c, which contains the T7 promoter, T7 
terminator, and the E^ coli ompT secretion signal; and pET 1 5b and 
pET19b {NOVAGEN, Madison, Wl), which contain a His-Tag™ leader 
sequence for use in purification with a His column and a thrombin 

10 cleavage site that permits cleavage following purification over the column; 
the T7-lac promoter region and the T7 terminator. 

The vectors are introduced into host cells, such as Pichia cells and 
bacterial cells, such as E. coli, and the proteins expressed therein. 
Preferred Pichia strains, include, for example, GS115. Preferred bacterial 

15 hosts contain chromosomal copies of DNA encoding T7 RNA polymerase 
operably linked to an inducible promoter, such as the lacUV promoter 
(see, U.S. Patent No. 4,952,496). Such hosts include, but are not limited 
to, the lysogenic E^ coli strain BL21 (DE3). 



various methods known in the art. For example, once a recombinant cell 
expressing an MTSP protein, or a domain, fragment or derivative thereof, 
is identified, the individual gene product can be isolated and analyzed. 
This is achieved by assays based on the physical and/or functional 

25 properties of the protein, including, but not limited to, radioactive labeling 
of the product followed by analysis by gel electrophoresis, immunoassay, 
cross-linking to marker-labeled product. The MTSP protein proteins may 
be isolated and purified by standard methods known in the art (either 
from natural sources or recombinant host cells expressing the complexes 

30 or proteins), including but not restricted to column chromatography (e.g., 
ion exchange, affinity, gel exclusion, reversed-phase high pressure, fast 



20 



Expression and production of proteins 
The MTSP domains, derivatives and analogs be produced by 
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protein liquid, etc.), differential centrifugation, differential solubility, or by 
any other standard technique used for the purification of proteins. 
Functional properties may be evaluated using any suitable assay known in 
the art. 

5 Alternatively, once an MTSP protein or its domain or derivative is 

identified, the amino acid sequence of the protein can be deduced from 
the nucleotide sequence of the gene which encodes it. As a result, the 
protein or its domain or derivative can be synthesized by standard 
chemical methods known in the art {e.g. see Hunkapiller et al. Nature 

10 310:105-1 1 1 (1984)). 

Manipulations of MTSP protein sequences may be made at the 
protein level. Also contemplated herein are MTSP protein proteins, 
domains thereof, derivatives or analogs or fragments thereof, which are 
differentially modified during or after translation, e.g., by glycosylation, 

15 acetylation, phosphorylation, amidation, derivatization by known 

protecting/blocking groups, proteolytic cleavage, linkage to an antibody 
molecule or other cellular ligand, etc. Any of numerous chemical 
modifications may be carried out by known techniques, including but not 
limited to specific chemical cleavage by cyanogen bromide, trypsin, 

20 chymotrypsin, papain, V8 protease, NaBH4, acetylation, formylation, 

oxidation, reduction, metabolic synthesis in the presence of tunicamycin, 
etc, 

In addition, domains, analogs and derivatives of an MTSP protein 
can be chemically synthesized. For example, a peptide corresponding to 

25 a portion of an MTSP protein, which includes the desired domain or which 
mediates the desired activity in vitro can be synthesized by use of a 
peptide synthesizer. Furthermore, if desired, nonclassical amino acids or 
chemical amino acid analogs can be introduced as a substitution or 
addition into the MTSP protein sequence. Non-classical amino acids 

30 include but are not limited to the D-isomers of the common amino acids, 
a-amino isobutyric acid, 4-aminobutyric acid, Abu, 2-aminobutyric acid. 
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e-Abu, e-Ahx, 6-amino hexanoic acid, Aib, 2-amino isobutyric acid, 
3-amino propionoic acid, ornithine, norleucine, norvaline, hydroxyproline, 
sarcosine, citrulline, cysteic acid, t-butylglycine, t-butylalanine, 
phenylglycine, cyclohexylalanine, B-alanine, fluoro-amino acids, designer 
5 amino acids such as B-methyl amino acids, Ca-methyl amino acids, Na- 
methyl amino acids, and amino acid analogs in general. Furthermore, the 
amino acid can be D (dextrorotary) or L (levorotary). 

In cases where natural products are suspected of being mutant or 
are isolated from new species, the amino acid sequence of the MTSP 

10 protein isolated from the natural source, as well as those expressed in 
vitro, or from synthesized expression vectors in vivo or in vitro, can be 
determined from analysis of the DNA sequence, or alternatively, by direct 
sequencing of the isolated protein. Such analysis may be performed by 
manual sequencing or through use of an automated amino acid 

15 sequenator. 

Modifications 

A variety of modification of the MTSP proteins and domains are 
contemplated herein. An MTSP-encoding nucleic acid molecule be 
modified by any of numerous strategies known in the art (Sambrook et 

20 al., 1990, Molecular Cloning, A Laboratory Manual, 2d ed.. Cold Spring 
Harbor Laboratory, Cold Spring Harbor, New York). The sequences can 
be cleaved at appropriate sites with restriction endonuclease{s), followed 
by further enzymatic modification if desired, isolated, and ligated in vitro. 
In the production of the gene encoding a domain, derivative or analog of 

25 MTSP, care should be taken to ensure that the modified gene retains the 
original translational reading frame, uninterrupted by translational stop 
signals, in the gene region where the desired activity is encoded. 

Additionally, the MTSP-encoding nucleic acid molecules can be 
mutated in vitro or in vivo, to create and/or destroy translation, initiation, 

30 and/or termination sequences, or to create variations in coding regions 
and/or form new restriction endonuclease sites or destroy pre-existing 
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ones, to facilitate further in vitro modification. Also, as described herein 
muteins with primary sequence alterations, such as replacements of Cys 
residues and elimination of glycosylation sites are contemplated. Such 
mutations may be effected by any technique for mutagenesis known in 
5 the art, including, but not limited to, chemical mutagenesis and in vitro 
site-directed mutagenesis (Hutchinson et al., J. Biol. Cliem, 253 :6551- 
6558 (1978)), use of TAB® linkers (Pharmacia). In one embodiment, for 
example, an MTSP protein or domain thereof is modified to include a 
fluorescent label. In other specific embodiments, the MTSP protein is 

10 modified to have a heterofunctional reagent, such heterofunctional 
reagents can be used to crosslink the members of the complex. 

The MTSP proteins may be isolated and purified by standard 
methods known in the art (either from natural sources or recombinant 
host cells expressing the complexes or proteins), including but not 

15 restricted to column chromatography [e.g., ion exchange, affinity, gel 
exclusion, reversed-phase high pressure, fast protein liquid, etc.), 
differential centrifugation, differential solubility, or by any other standard 
technique used for the purification of proteins. Functional properties may 
be evaluated using any suitable assay known in the art. 

20 Alternatively, once a MTSP or its domain or derivative is identified, 

the amino acid sequence of the protein can be deduced from the 
nucleotide sequence of the gene which encodes it. As a result, the 
protein or its domain or derivative can be synthesized by standard 
chemical methods known in the art (e.g., see Hunkapiller et al. Nature, 

25 310:105-111 (1984)). 

Manipulations of MTSP sequences may be made at the protein 
level. MTSP domains, derivatives or analogs or fragments, which are 
differentially modified during or after translation, e.g., by glycosylation, 
acetylation, phosphorylation, amidation, derivatization by known 

30 protecting/blocking groups, proteolytic cleavage, linkage to an antibody 
molecule and other cellular ligand, are contemplated herein. Any of 
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numerous chemical modifications may be carried out by known 
techniques, including but not limited to specific chemical cleavage by 
cyanogen bromide, trypsin, chymotrypsin, papain, V8 protease, NaBH4, 
acetylation, formylation, oxidation, reduction, metabolic synthesis in the 
5 presence of tunicamycin, etc. 

In addition, domains, analogs and derivatives of a MTSP can be 
chemically synthesized. For example, a peptide corresponding to a 
portion of a MTSP, which comprises the desired domain or which 
mediates the desired activity in vitro can be synthesized by use of a 

10 peptide synthesizer. Furthermore, if desired, nonclassical amino acids or 
chemical amino acid analogs can be introduced as a substitution or 
addition into the MTSP sequence. Non-classical amino acids include but 
are not limited to the D-isomers of the common amino acids, a-amino 
isobutyric acid, 4-aminobutyric acid, Abu, 2-aminobutyric acid, 6-Abu, 

15 e-Ahx, 6-amino hexanoic acid, Aib, 2-amino isobutyric acid, 3-amino 
propionoic acid, ornithine, norleucine, norvaline, hydroxyproline, 
sarcosine, citrulline, cysteic acid, t-butylglycine, t-butylalanine, 
phenylglycine, cyclohexylalanine, S-alanine, fluoro-amino acids, designer 
amino acids such as IS-methyl amino acids, Ca-methyl amino acids, Na- 

20 methyl amino acids, and amino acid analogs in general. Furthermore, the 
amino acid can be D (dextrorotary) or L (levorotary). 
F. SCREENING METHODS 

The single chain protease domains, as shown herein, can be used 
in a variety of methods to identify compounds that modulate the activity 

25 thereof. For MTSPs that are exhibit higher activity or expression in tumor 
cells, compounds that inhibit the proteolytic activity are of particular 
interest. For any MTSPs that are active at lower levels in tumor cells, 
compounds or agents that enhance the activity are potentially of interest. 
In all instances the identified compounds will include agents that are 

30 candidate cancer treatments. 
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Several types of assays are exemplified and described herein. It is 
understood that the protease domains may be used in other assays. It is 
shown here, however, that the single chain protease domains exhibit 
catalytic activity. As such they are ideal for in vitro screening assays. 
5 They may also be used in binding assays. 

The MTSPS, MTSP4 and MTSP6 full length zymogens, activated 
enzymes, single and double chain protease domains are contemplated for 
use in any screening assay known to those of skill in the art, including 
those provided herein. Hence the following description, if directed to 

10 proteolytic assays is intended to apply to use of a single chain protease 
domain or a catalytically active portion thereof of any MTSP, including an 
MTSP3, MTSP4 or an MTSP6. Other assays, such as binding assays are 
provided herein, particularly for use with an MTSP3, MTSP4 or MTSP6, 
including any variants, such as splice variants thereof. MTSP3 and 

15 MTSP4 are of most interest in such assays. 

1 . Catalytic Assays for identification of agents that modulate 
the protease activity of an MTSP protein 

Methods for identifying a modulator of the catalytic activity of an 

MTSP, particularly a single chain protease domain or catalytically active 

20 portion thereof, are provided herein. The methods can be practiced by: 
a) contacting the MTSP, particularly a single-chain domain thereof, with a 
substrate of the MTSP in the presence of a test substance, and detecting 
the proteolysis of the substrate, whereby the activity of the MTSP is 
assessed, and comparing the activity to a control. For example, the 

25 control can be the activity of the MTSP assessed by contacting an MTSP, 
particularly a single-chain domain thereof, with a substrate of the MTSP, 
and detecting the proteolysis of the substrate, whereby the activity of the 
MTSP is assessed. The results in the presence and absence of the test 
compounds are compared. A difference in the activity indicates that the 

30 test substance modulates the activity of the MTSP. 
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In one embodiment a plurality of the test substances are screened 
simultaneously in the above screening method. In another embodiment, 
the MTSP is isolated from a target cell as a means for then identifying 
agents that are potentially specific for the target cell. 
5 In still another embodiment, The test substance is a therapeutic 

compound, and whereby a difference of the MTSP activity measured in 
the presence and in the absence of the test substance indicates that the 
target cell responds to the therapeutic compound. 

One method include the steps of (a) contacting the MTSP protein or 

10 protease domain thereof with one or a plurality of test compounds under 
conditions conducive to interaction between the ligand and the 
compounds; and (b) identifying one or more compounds in the plurality 
that specifically binds to the ligand. 

Another method provided herein includes the steps of a) contacting 

15 an MTSP protein or protease domain thereof with a substrate of the 

MTSP protein, and detecting the proteolysis of the substrate, whereby the 
activity of the MTSP protein is assessed; b) contacting the MTSP protein 
with a substrate of the MTSP protein in the presence of a test substance, 
and detecting the proteolysis of the substrate, whereby the activity of the 

20 MTSP protein is assessed; and c) comparing the activity of the MTSP 

protein assessed in steps a) and b), whereby the activity measured in step 
a) differs from the activity measured in step b) indicates that the test 
substance modulates the activity of the MTSP protein. 

In another embodiment, a plurality of the test substances are 

25 screened simultaneously. In comparing the activity of an MTSP protein in 
the presence and absence of a test substance to assess whether the test 
substance is a modulator of the MTSP protein, it is unnecessary to assay 
the activity in parallel, although such parallel measurement is preferred. It 
is possible to measure the activity of the MTSP protein at one time point 

30 and compare the measured activity to a historical value of the activity of 
the MTSP protein. 
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For instance, one can measure the activity of the MTSP protein in 
the presence of a test substance and compare with historical value of the 
activity of the MTSP protein measured previously in the absence of the 
test substance, and vice versa. This can be accomplished, for example, 
5 by providing the activity of the MTSP protein on an insert or pamphlet 
provided with a kit for conducting the assay. 

Methods for selecting substrates for a particular MTSP are 
described in the EXAMPLES, and particular proteolytic assays are 
exemplified. 

10 Combinations and kits containing the combinations optionally 

including instructions for performing the assays are provided. The 
combinations include an MTSP protein and a substrate of the MTSP 
protein to be assayed; and, optionally reagents for detecting proteolysis 
of the substrate. The substrates, which are typically proteins subject to 

15 proteolysis by a particular MTSP protein, can be identified empirically by 
testing the ability of the MTSP protein to cleave the test substrate. 
Substrates that are cleaved most effectively (i.e., at the lowest 
concentrations and/or fastest rate or under desirable conditions), are 
identified. 

20 Additionally provided herein is a kit containing the above-described 

combination. Preferably, the kit further includes instructions for 
identifying a modulator of the activity of an MTSP protein. Any MTSP 
protein is contemplated as target for identifying modulators of the activity 
thereof. 

25 2. Binding assays 

Also provided herein are methods for identification and isolation of 
agents, particularly compounds that bind to MTSPs. The assays are 
designed to identify agents that bind to the zymogen form, the single 
chain isolated protease domain (or a protein, other than an MTSP protein, 
30 that contains the protease domain of an MTSP protein), and to the 

activated form, including the activated form derived from the full length 
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zymogen or from an extended protease domain. The identified 
compounds are candidates or leads for identification of compounds for 
treatments of tumors and other disorders and diseases involving aberrant 
angiogenesis. The MTSP proteins used in the methods include any 
5 MTSP protein as defined herein, and preferably use MTSP single chain 
domain or proteolytically active portion thereof. 

A variety of methods are provided herein. These methods may be 
performed in solution or in solid phase reactions in which the MTSP 
protein{s) or protease domain(s) thereof are linked, either directly or 
10 indirectly via a linker, to a solid support. Screening assays are described 
in the Examples, and these assays have been used to identify candidate 
compounds. 

For purposes herein, all binding assays described above are 
provided for MTSP3, MTSP4 and MTSP6 (CHECK REF). For MTSP1 
15 (including any variant thereof) and other such proteases, binding assays 
that employ the isolated single chain protease domain or a protein 
containing such domain (other than the MTSP from which the protease is 

derived) are provided. 

Methods for identifying an agent, such as a compound, that 

20 specifically binds to an MTSP single chain protease domain or an MTSP, 
such as an MTSP3, MTSP4 or an MTSP6, are provided herein. The 
method can be practiced by (a) contacting the MTSP with one or a 
plurality of test agents under conditions conducive to binding between the 
MTSP and an agent; and (b) identifying one or more agents within the 

25 plurality that specifically binds to the MTSP. 

For example, in practicing such methods the MTSP polypeptide is mixed 
with a potential binding partner or an extract or fraction of a cell under 
conditions that allow the association of potential binding partners with the 
polypeptide. After mixing, peptides, polypeptides, proteins or other 

30 molecules that have become associated with an MTSP are separated from 
the mixture. The binding partner that bound to the MTSP can then be 
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removed and further analyzed. To identify and isolate a binding partner, 
the entire protein, for instance the entire disclosed protein of SEQ ID Nos. 
6, 8 10 or 12 can be used. Alternatively, a fragment of the protein can 
be used. 

5 A variety of methods can be used to obtain cell extracts. Cells can 

be disrupted using either physical or chemical disruption methods. 
Examples of physical disruption methods include, but are not limited to, 
sonication and mechanical shearing. Examples of chemical lysis methods 
include, but are not limited to, detergent lysis and enzyme lysis. A skilled 

10 artisan can readily adapt methods for preparing cellular extracts in order 
to obtain extracts for use in the present methods. 

Once an extract of a cell is prepared, the extract is mixed with the 
MTSP under conditions in which association of the protein with the 
binding partner can occur. A variety of conditions can be used, the most 

15 preferred being conditions that closely resemble conditions found in the 
cytoplasm of a human cell. Features such as osmolarity, pH, 
temperature, and the concentration of cellular extract used, can be varied 
to optimize the association of the protein with the binding partner. 

After mixing under appropriate conditions, the bound complex is 

20 separated from the mixture. A variety of techniques can be used to 

separate the mixture. For example, antibodies specific to an MTSP can 
be used to immunoprecipitate the binding partner complex. Alternatively, 
standard chemical separation techniques such as chromatography and 
density/sediment centrifugation can be used. 

25 After removing the non-associated cellular constituents in the 

extract, the binding partner can be dissociated from the complex using 
conventional methods. For example, dissociation can be accomplished by 
altering the salt concentration or pH of the mixture. 

To aid in separating associated binding partner pairs from the mixed 

30 extract, the MTSP can be immobilized on a solid support- For example, 
the protein can be attached to a nitrocellulose matrix or acrylic beads. 
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Attachment of the protein or a fragment thereof to a solid support aids in 
separating peptide/binding partner pairs from other constituents found in 
the extract. The identified binding partners can be either a single protein 
or a complex made up of two or more proteins. 

Alternatively, the nucleic acid molecules encoding the single chain 
proteases can be used in a yeast two-hybrid system. The yeast two- 
hybrid system has been used to identify other protein partner pairs and 
can readily be adapted to employ the nucleic acid molecules herein 
described. 

Another in vitro binding assay, particularly for an MTSP3, MTSP4 
or an MTSP6 uses a mixture of a polypeptide that contains at least the 
catalytic domain of one of these proteins and one or more candidate 
binding targets or substrates. After incubating the mixture under 
appropriate conditions, one determines whether the MTSP or a 
polypeptide fragment thereof containing the catalytic domain binds with 
the candidate substrate. For cell-free binding assays, one of the 
components includes or is coupled to a detectable label. The label may 
provide for direct detection, such as radioactivity, luminescence, optical 
or electron density, etc, or indirect detection such as an epitope tag, an 
enzyme, etc, A variety of methods may be employed to detect the label 
depending on the nature of the label and other assay components. For 
example, the label may be detected bound to the solid substrate or a 
portion of the bound complex containing the label may be separated from 
the solid substrate, and the label thereafter detected. 

3. Detection of signal transduction 

The cell surface location of the MTSPs suggests a role for some or 
all of these proteins in signal transduction. Assays for assessing signal 
transduction are well known to those of skill in the art, and may be 
adapted for use with the MTSP protein. 

Assays for identifying agents that effect or alter signal transduction 
mediated by an MTSP, particularly the full length or a sufficient portion to 
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anchor the extracellular domain or a function portion thereof of an MTSP 

on the surface of a cell are provided. Such assays, include, for example, 

transcription based assays in which modulation of a transduced signal is 

assessed by detecting an effect on an expression from a reporter gene 

5 (see, e.g., U.S. Patent No. 5,436,128). 

4. Methods for Identifying Agents that Modulate the 

Expression a Nucleic Acid Encoding an MTSP, particularly an 
MTSP3, MTSP4 or MTSP6 

Another embodiment provides methods for identifying agents that 

10 modulate the expression of a nucleic acid encoding an MTSP, particularly 
an MTSP3, MTSP4 or MTSP. Such assays use any available means of 
monitoring for changes in the expression level of the nucleic acids 
encoding an MTSP, such as MTSP3 or MTSP4. 

In one assay format, cell lines that contain reporter gene fusions 

15 between the open reading frame of MTSP3, MTSP4 or MTSP6 or a 

domain thereof, particularly the protease domain and any assayabie fusion 
partner may be prepared. Numerous assayabie fusion partners are known 
and readily available including the firefly luciferase gene and the gene 
encoding chloramphenicol acetyltransferase (Alam et al.. Anal. Biochem, 

20 188: 245-54 (1990)). Cell lines containing the reporter gene fusions are 
then exposed to the agent to be tested under appropriate conditions and 
time. Differential expression of the reporter gene between samples 
exposed to the agent and control samples identifies agents which 
modulate the expression of a nucleic acid encoding an MTSP3, MTSP4 or 

25 MTSP6. 

Additional assay formats may be used to monitor the ability of the 
agent to modulate the expression of a nucleic acid encoding an MTSP3, 
MTSP4 or MTSP6. For instance, mRNA expression may be monitored 
directly by hybridization to the nucleic acids. Cell lines are exposed to the 
30 agent to be tested under appropriate conditions and time and total RNA or 
mRNA is isolated by standard procedures (see, e.g., Sambrook et al. 
(1989) MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed. Cold 
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Spring Harbor Laboratory Press). Probes to detect differences in RNA 
expression levels between cells exposed to the agent and control cells 
may be prepared from the nucleic acids. It is preferable, but not 
necessary, to design probes which hybridize only with target nucleic acids 
5 under conditions of high stringency. Only highly complementary nucleic 
acid hybrids form under conditions of high stringency. Accordingly, the 
stringency of the assay conditions determines the amount of 
complementarity which should exist between two nucleic acid strands in 
order to form a hybrid. Stringency should be chosen to maximize the 

10 difference in stability between the probe:target hybrid and potential 
probe:non-target hybrids. 

Probes may be designed from the nucleic acids through methods 
known in the art. For instance, the G + C content of the probe and the 
probe length can affect probe binding to its target sequence. Methods to 

15 optimize probe specificity are commonly available (see, e.g., Sambrook et 
aL (1989) MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed. 
Cold Spring Harbor Laboratory Press); and Ausubel et al. (1995) 
CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, Greene Publishing 
Co., NY). 

20 Hybridization conditions are modified using known methods (see, 

e,g.r Sambrook etal. (1989) MOLECULAR CLONING: A LABORATORY 
MANUAL, 2nd Ed. Cold Spring Harbor Laboratory Press); and Ausubel et 
al. (1995) CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, Greene 
Publishing Co., NY), as required for each probe. Hybridization of total 

25 cellular RNA or RNA enriched for polyA RNA can be accomplished in any 
available format. For instance, total cellular RNA or RNA enriched for 
polyA RNA can be affixed to a solid support, and the solid support 
exposed to at least one probe comprising at least one, or part of one of 
the nucleic acid molecules under conditions in which the probe will 

30 specifically hybridize. Alternatively, nucleic acid fragments comprising at 
least one, or part of one of the sequences can be affixed to a solid 
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support, such as a porous glass wafer. The glass wafer can then be 
exposed to total cellular RNA or polyA RNA from a sample under 
conditions in which the affixed sequences will specifically hybridize. 
Such glass wafers and hybridization methods are widely available, for 
5 example, those disclosed by Beattie (WO 95/1 1755). By examining for 
the ability of a given probe to specifically hybridize to an RNA sample 
from an untreated cell population and from a cell population exposed to 
the agent, agents which up or down regulate the expression of a nucleic 
acid encoding the protein having the sequence of SEQ ID N0:3 or SEQ ID 

10 N0:4 are identified. 

5. Methods for Identifying Agents that Modulate at Least One 
Activity of an MTPS, such as MTSP3, MTSP4 or MTSP6 

Methods for identifying agents that modulate at least one activity 

of a an MTSP, such as an MTSP3, MTSP4 or MTSP6 are provided. Such 

15 methods or assays may use any means of monitoring or detecting the 

desired activity. 

In one format, the relative amounts of a protein between a cell 
population that has been exposed to the agent to be tested compared to 
an un-exposed control cell population may be assayed {e.g,, a prostate 

20 cancer cell line, a lung cancer cell line, a colon cancer cell line or a breast 
cancer cell line). In this format, probes, such as specific antibodies, are 
used to monitor the differential expression of the protein in the different 
cell populations. Cell lines or populations are exposed to the agent to be 
tested under appropriate conditions and time. Cellular lysates may be 

25 prepared from the exposed cell line or population and a control, 

unexposed cell line or population. The cellular lysates are then analyzed 
with the probe. 

For example, N- and C- terminal fragments of the MTSP can be 
expressed in bacteria and used to search for proteins which bind to these 
30 fragments. Fusion proteins, such as His-tag or GST fusion to the N- or C- 
terminal regions of the MTSP, such as an MTSP3, MTSP4 or an MTSP6, 
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can be prepared for use as a substrate. These fusion proteins can be 
coupled to, for example, Glutathione-Sepharose beads and then probed 
with cell lysates. Prior to lysis, the cells may be treated with a candidate 
agent which may modulate an MTSP, such as an MTSPS, MTSP4 or an 
5 MTSP6, or proteins that interact with domains thereon. Lysate proteins 
binding to the fusion proteins can be resolved by SDS-PAGE, isolated and 
identified by protein sequencing or mass spectroscopy, as is known in the 
art. 

Antibody probes are prepared by immunizing suitable mammalian 

10 hosts in appropriate immunization protocols using the peptides, 

polypeptides or proteins if they are of sufficient length {e.g,, 4, 5, 6, 7, 8, 
9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 35, 40 or more consecutive amino 
acids the MTSP protein, such as an MTSP3, an MTSP4 or an MTSP6), or 
if required to enhance immunogenicity, conjugated to suitable carriers. 

15 Methods for preparing immunogenic conjugates with carriers, such as 
bovine serum albumin (BSA), keyhole limpet hemocyanin (KLH), or other 
carrier proteins are well known in the art. In some circumstances, direct 
conjugation using, for example, carbodiimide reagents may be effective; 
in other instances linking reagents such as those supplied by Pierce 

20 Chemical Co., Rockford, IL, may be desirable to provide accessibility to 
the hapten. Hapten peptides can be extended at either the amino or 
carboxy terminus with a Cys residue or interspersed with cysteine 
residues, for example, to facilitate linking to a carrier. Administration of 
the immunogens is conducted generally by injection over a suitable time 

25 period and with use of suitable adjuvants, as is generally understood in 
the art. During the immunization schedule, titers of antibodies are taken 
to determine adequacy of antibody formation. 

Anti-peptide antibodies can be generated using synthetic peptides 
corresponding to, for example, the carboxy terminal amino acids of the 

30 MTSP. Synthetic peptides can be as small as 1-3 amino acids in length, 
but are preferably at least 4 or more amino acid residues long. The 
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peptides can be coupled to KLH using standard methods and can be 
immunized into animals, such as rabbits or ungulate. Polyclonal 
antibodies can then be purified, for example using Actigel beads 
containing the covalently bound peptide. 
5 While the polyclonal antisera produced in this way may be 

satisfactory for some applications, for pharmaceutical compositions, use 
of monoclonal preparations is preferred. Immortalized cell lines which 
secrete the desired monoclonal antibodies may be prepared using the 
standard method of Kohler ef a/., (Nature 256: 495-7 (1975)) or 

10 modifications which effect immortalization of lymphocytes or spleen cells, 
as is generally known. The immortalized cell lines secreting the desired 
antibodies are screened by immunoassay in which the antigen is the 
peptide hapten, polypeptide or protein. When the appropriate 
immortalized cell culture secreting the desired antibody is identified, the 

15 cells can be cultured either in vitro or by production in vivo via ascites 
fluid. Of particular interest, are monoclonal antibodies that recognize the 
catalytic domain of the an MTSP, such as an MTSP3, MTSP4 or an 
MTSP6. 

Additionally, the zymogen or two-chain forms the MTSP can be 
20 used to make monoclonal antibodies which recognize conformation 

epitopes. For peptide-directed monoclonal antibodies, peptides from the 
CIr/CIs domain can be used to generate anti-Clr/CIs domain monoclonal 
antibodies which can thereby block activation of the zymogen to the two- 
chain form of the MTSP. This domain can similarly be the substrate for 
25 other non-antibody compounds which bind to these preferred domains on 
either the single-chain or double-chain forms of the MTSP3, MTSP4 or 
MTSP6, and thereby modulate the activity of thereof or prevent its 
activation. 

The desired monoclonal antibodies are then recovered from the 
30 culture supernatant or from the ascites supernatant. Fragments of the 

monoclonals or the polyclonal antisera which contain the immunologically 
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significant portion can be used as antagonists, as well as the intact 
antibodies. Use of innmunologically reactive fragnnents, such as the Fab, 
Fab', of F{ab')2 fragments are often preferable, especially in a therapeutic 
context, as these fragments are generally less immunogenic than the 
5 whole immunoglobulin. 

The antibodies or fragments may also be produced. Regions that 
bind specifically to the desired regions of receptor can also be produced in 
the context of chimeras with multiple species origin. 

Agents that are assayed in the above method can be randomly 
10 selected or rationally selected or designed. 

The agents can be, as examples, peptides, small molecules, and 
carbohydrates. A skilled artisan can readily recognize that there is no 
limit as to the structural nature of the agents. 

The peptide agents can be prepared using standard solid phase (or 
15 solution phase) peptide synthesis methods, as is known in the art. In 
addition, the DNA encoding these peptides may be synthesized using 
commercially available oligonucleotide synthesis instrumentation and 
produced recombinantly using standard recombinant production systems. 
The production using solid phase peptide synthesis is necessitated if non- 
20 gene-encoded amino acids are to be included. 

G- Assay formats and selection of test substances 

A variety of formats and detection protocols are known for 
performing screening assays. Any such formats and protocols may be 
adapted for identifying modulators of MTSP protein activities. The 
25 following includes a discussion of exemplary protocols. 

1 . High throughput screening assays 

Although the above-described assay can be conducted where a 
single MTSP protein is screened, and/or a single test substance is 
screened for in one assay, the assay is preferably conducted in a high 
30 throughput screening mode, i.e., a plurality of the MTSP proteins are 

screened against and/or a plurality of the test substances are screened for 
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simultaneously (See generally. High Throughput Screening: The Discovery 
of Bioactive Substances (Devlin, Ed.) Marcel Dekker, 1997; Sittampalam 
et al., Curr. Opin, Chem, BioL, 1(31:384-91 (1997); and Silverman et a!., 
Curr, Opin. Chem, Biol., 2(3) :397-403 (1998)). For example, the assay 
5 can be conducted in a multi-well (e.g., 24-, 48-, 96-, or 384-well), chip or 
array format. 

High-throughput screening (HTS) is the process of testing a large 
number of diverse chemical structures against disease targets to identify 
"hits" (Sittampalam et al., Curr. Opin. Chem. Biol., 1(3) :384-91 (1997)). 

10 Current state-of-the-art HTS operations are highly automated and 

computerized to handle sample preparation, assay procedures and the 
subsequent processing of large volumes of data. 

Detection technologies employed in high-throughput screens 
depend on the type of biochemical pathway being investigated 

15 (Sittampalam et aL, Curr, Opin. Chem. BioL, 1(3) :384-91 (1997)). These 
methods include, radiochemical methods, such as the scintillation 
proximity assays (SPA), which can be adapted to a variety of enzyme 
assays (Lerner et al., J. BiomoL Screening, 1:135-143 (1996); Baker et 
aL, Anal. Biochem., 239 :20-24 (1996); Baum et al.. Anal. Biochem., 

20 237:129-134 (1996); and Sullivan et aL, J. BiomoL Screening, 2:19-23 
(1997)) and protein-protein interaction assays (Braunwalder et aL, J. 
BiomoL Screening, 1:23-26 (1996); Sonatore et aL, Anal. Biochem., 
240:289-297 (1996); and Chen et aL, J. Biol. Chem., 271 :25308-25315 
(1996)), and non-isotopic detection methods, including but are not limited 

25 to, colorimetric and luminescence detection methods, resonance energy 
transfer (RET) methods, time-resolved fluorescence (HTRF) methods, cell- 
based fluorescence assays, such as fluorescence resonance energy 
transfer (FRET) procedures (see, e.g^,, Gonzalez et a!., Biophys. J., 
69:1272-1280 (1995)), fluorescence polarization or anisotropy methods 

30 (see, e,g., Jameson et aL, Methods EnzymoL, 246 :283-300 (1995); 
Jolley, J. BiomoL Screening, 1:33-38 (1996); Lynch et aL, Anal. 
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Biochem,, 247 :77-82 (1997)), fluorescence correlation spectroscopy 
(FCS) and other such methods. 

2. Test Substances 
Test compounds, including small molecules and libraries and 
5 collections thereof can be screened in the above-described assays and 
assays described below to identify compounds that modulate the activity 
an MTSP protein. Rational drug design methodologies that rely on 
computational chemistry may be used to screen and identify candidate 
compounds. 

10 The compounds identified by the screening methods include 

inhibitors, including antagonists, and may be agonists Compounds for 
screening are any compounds and collections of compounds available, 
know or that can be prepared. 

a. Selection of Compounds 

15 Compounds can be selected for their potency and selectivity of 

inhibition of serine proteases, especially MTSP protein. As described 
herein, and as generally known, a target serine protease and its substrate 
are combined under assay conditions permitting reaction of the protease 
with its substrate. The assay is performed in the absence of test 

20 compound, and in the presence of increasing concentrations of the test 
compound. The concentration of test compound at which 50% of the 
serine protease activity is inhibited by the test compound is the iCgo value 
(Inhibitory Concentration) or EC50 (Effective Concentration) value for that 
compound. Within a series or group of test compounds, those having 

25 lower IC50 or EC50 values are considered more potent inhibitors of the 

serine protease than those compounds having higher IC50 or EC50 values. 
The IC50 measurement is often used for more simplistic assays, whereas 
the EC50 is often used for more complicated assays, such as those 
employing cells. 

30 Preferred compounds according to this aspect have an IC50 value of 

100 nM or less as measured in an in vitro assay for inhibition of MTSP 
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protein activity. Especially preferred compounds have an IC50 value of 
less than 100 nM. 

The test compounds also are evaluated for selectivity toward a 
serine protease. As described herein, and as generally known, a test 
5 compound is assayed for its potency toward a panel of serine proteases 
and other enzymes and an IC50 value or EC50 value is determined for each 
test compound in each assay system. A compound that demonstrates a 
low IC50 value or EC50 value for the target enzyme, e,g., MTSP protein, 
and a higher IC50 value or EC50 value for other enzymes within the test 

10 panel (e,g., urokinase tissue plasminogen activator, thrombin, Factor Xa), 
is considered to be selective toward the target enzyme. Generally, a 
compound is deemed selective if its IC50 value or EC50 value in the target 
enzyme assay is at least one order of magnitude less than the next 
smallest IC50 value or EC50 value measured in the selectivity panel of 

15 enzymes. 

Presently preferred compounds have an IC50 value of 100 nM or 
less as measured in an in vitro assay for inhibition of urokinase activity. 
Especially preferred compounds have an IC50 value in the in vitro 
urokinase inhibition assay that is at least one order of magnitude smaller 

20 than the IC50 value measured in the in vitro tPA inhibition assay. 

Compounds having a selectivity ratio of IC50 u-PA assay: IC50 MTSP 
protein assay of greater than 100 are especially preferred. 

Compounds are also evaluated for their activity in vivo. The type 
of assay chosen for evaluation of test compounds will depend on the 

25 pathological condition to be treated or prevented by use of the 

compound, as well as the route of administration to be evaluated for the 
test compound. 

For instance, to evaluate the activity of a compound to reduce 
tumor growth through inhibition of MTSP protein, the procedures 
30 described by Jankun et al.. Cane, Res,, 57:559-563 (1997) to evaluate 
PAI-1 can be employed. Briefly, the ATCC cell lines DU145 and LnCaP 
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are injected into SCID mice. After tumors are established, the mice are 
given test compound according to a dosing regime determined from the 
compound's in vitro characteristics. The Jankun et al. compound was 
administered in water. Tumor volume measurements are taken twice a 
5 week for about five weeks. A compound is deemed active if an animal to 
which the compound was administered exhibited decreased tumor 
volume, as compared to animals receiving appropriate control compounds. 

Another in vivo experimental model designed to evaluate the effect 
of p-aminobenzamidine, a swine protease inhibitor, on reducing tumor 
10 volume is described by Billstrom et a!.. Int. J. Cancer, 61:542-547 
(1995), 

To evaluate the ability of a compound to reduce the occurrence of, 
or inhibit, metastasis, the procedures described by Kobayashi et al., Int. 
J. Cane, 57:727-733d (1994) can be employed. Briefly, a murein 

15 xenograft selected for high lung colonization potential in injected into 
C57B1/6 mice i.v. (experimental metastasis) or s.c. into the abdominal 
wall (spontaneous metastasis). Various concentrations of the compound 
to be tested can be admixed with the tumor cells in Matrigel prior to 
injection. Daily i.p. injections of the test compound are made either on 

20 days 1-6 or days 7-13 after tumor inoculation. The animals are sacrificed 
about three or four weeks after tumor inoculation, and the lung tumor 
colonies are counted. Evaluation of the resulting data permits a 
determination as to efficacy of the test compound, optimal dosing and 
route of administration, 

25 The activity of the tested compounds toward decreasing tumor 

volume and metastasis can be evaluated in model described in Rabbani et 
al.. Int. J. Cancer 63:840-845 (1995) to evaluate their inhibitor. There, 
Mat LyLu tumor ceils were injected into the flank of Copenhagen rats. 
The animals were implanted with osmotic minipumps to continuously 

30 administer various doses of test compound for up to three weeks. The 
tumor mass and volume of experimental and control animals were 
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evaluated during the experiment, as were metastatic growths. Evaluation 
of the resulting data permits a determination as to efficacy of the test 
compound, optimal dosing, and route of administration. Some of these 
authors described a related protocol in Xing et a!.. Cane. Res., 57:3585- 
5 3593 (1997). 

To evaluate the inhibitory activity of a compound, a rabbit cornea 
neovascularization model can be employed. Avery et al.. Arch, 
Ophthalmol., 108 :1474-1475 (1990) describe anesthetizing New Zealand 
albino rabbits and then making a central corneal incision and forming a 

10 radial corneal pocket. A slow release prostaglandin pellet was placed in 
the pocket to induce neovascularization. Test compound was 
administered i.p. for five days, at which time the animals were sacrificed. 
The effect of the test compound is evaluated by review of periodic 
photographs taken of the limbus, which can be used to calculate the area 

15 of neovascular response and, therefore, limbal neovascularization. A 
decreased area of neovascularization as compared with appropriate 
controls indicates the test compound was effective at decreasing or 
inhibiting neovascularization. 

As angiogenesis model used to evaluate the effect of a test 

20 compound in preventing angiogenesis is described by Min et al.. Cane. 
Res., 56:2428-2433 (1996). C57BL6 mice receive subcutaneous 
injections of a Matrigel mixture containing bFGF, as the angiogenesis- 
inducing agent, with and without the test compound. After five days, the 
animals are sacrificed and the Matrigel plugs, in which neovascularization 

25 can be visualized, are photographed. An experimental animal receiving 
Matrigel and an effective dose of test compound will exhibit less 
vascularization than a control animal or an experimental animal receiving a 
less- or non-effective does of compound. 

An In vivo system designed to test compounds for their ability to 

30 limit the spread of primary tumors is described by Crowley et a!., Proc. 
Natl. Acad. ScL, 90:5021-5025 (1993). Nude mice are injected with 
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tumor cells (PCS) engineered to express CAT (chloramphenicol 
acetyltransf erase). Compounds to be tested for their ability to decrease 
tumor size and/or metastases are administered to the animals, and 
subsequent measurements of tumor size and/or metastatic growths are 
5 made. In addition, the level of CAT detected in various organs provides 
an indication of the ability of the test compound to inhibit metastasis; 
detection of less CAT in tissues of a treated animal versus a control 
animal indicates less CAT-expressing cells migrated to that tissue. 

In vivo experimental modes designed to evaluate the inhibitory 

10 potential of a test serine protease inhibitors, using a tumor cell line F3II, 
the to be highly invasive, are described by Alonso et al.. Breast Cane, 
Res. Treat, 40:209-223 (1996). This group describes in vivo studies for 
toxicity determination, tumor growth, invasiveness, spontaneous 
metastasis, experimental lung metastasis, and an angiogenesis assay. 

15 The CAM model (chick embryo chorioallantoic membrane model), 

first described by L. Ossowski in 1998 (J, Cell Biol., 107 :2437-2445 
(1988)), provides another method for evaluating the urokinase inhibitory 
activity of a test compound. In the CAM model, tumor cells invade 
through the chorioallantoic membrane containing CAM with tumor cells in 

20 the presence of several serine protease inhibitors results in less or no 

invasion of the tumor cells through the membrane. Thus, the CAM assay 
is performed with CAM and tumor cells in the presence and absence of 
various concentrations of test compound. The invasiveness of tumor 
cells is measured under such conditions to provide an indication of the 

25 compound's inhibitory activity. A compound having inhibitory activity 
correlates with less tumor invasion. 

The CAM model is also used in a standard assay of angiogenesis 
(i.e., effect on formation of new blood vessels (Brooks et al.. Methods in 
Molecular Biology, 129 :257-269 (1999)). According to this model, a 

30 filter disc containing an angiogenesis inducer, such as basic fibroblast 
growth factor (bFDG) is placed onto the CAM. Diffusion of the cytokine 
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into the CAM induces local angiogenesis, which may be measured in 
several ways such as by counting the number of blood vessel branch 
points within the CAM directly below the filter disc. The ability of 
identified compounds to inhibit cytokine-induced angiogenesis can be 
5 tested using this model. A test compound can either be added to the 

filter disc that contains the angiogenesis inducer, be placed directly on the 
membrane or be administered systemically. The extent of new blood 
vessel formation in the presence and/or absence of test compound can be 
compared using this model. The formation of fewer new blood vessels in 

10 the presence of a test compound would be indicative of anti-angiogenesis 
activity. Demonstration of anti-angiogenesis activity for inhibitors of an 
MTSP protein indicates a role in angiogenesis for that MTSP protein. 

b. Known serine protease inhibitors 
Compounds for screening can be serine protease inhibitors, which 

15 can be tested for their ability to inhibit the activity of an MTSP, 
particularly an MTSP3, MTSP4, or MTSP6. 

Exemplary, but not limiting serine proteases, include the following 
known serine protease inhibitors are used in the screening assays: Serine 
Protease Inhibitor 3 (SPI-S) (Chen, M.C, et al,, Citokine, n(n) :856-862 

20 (1999)); Aprotinin (lijima, R., et a!., J. Biochem. (Tokyo), 1 26(5) :91 2-91 6 
(1999)); Kazai-type serine protease inhibitor-like proteins (Niimi, T., et a!., 
Eur. J. Biochem., 266(1 ) :282-292 (1999)); Kunitz-type serine protease 
inhibitor (Ravichandran, S,, et al.. Acta Crystallogr. D. Biol. Crystallogr. , 
551111:1814-1821 (1999)); Tissue factor pathway inhibitor-2/Matrix- 

25 associated serine rotease inhibitor (TFPI-2/MSPI), (Liu, Y., et al., Arcli. 
Biociiem. Biopliys., 370(1) :n2-8 (1999)); Bukunin, (Cui, C.Y., et al., J. 
Invest. Dermatol., 1 1 3(2) :182-8 (1999)); Nafmostat mesilate (Ryo, R., et 
al.. Vox Sang.) 76(4) :241-6 (1999)); TPCK (Huang, Y., et al., Oncogene, 
181231:3431-9 (1999)); A synthetic cotton-bound serine protease 

30 inhibitor (Edwards, J.V., et al.. Wound Repair Regen., 7(2) :106-18 
(1999)); FUT-175 (Sawada, M., et al.. Stroke, 30(3) :644-50 (1999)); 
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Combination of serine protease inhibitor FUT-0175 and thromboxane 
synthetase Inhibitor OKY-046 (Kaminogo, M., et al., Neurol. Med. Chir. 
(Tokyo), 38(1 1) :704-8: discussion 708-9 (1998)); The rat serine protease 
inhibitor 2.1 gene (LeCam, A., et al., Biochem. Biophys. Res. Commun., 
5 253(2) :31 1-4 (1998)); A new intracellular serine protease inhibitor 
expressed in the rat pituitary gland complexes with granzyme B (Hill, 
R.M., et al., FEBS Lett., 440(3) :361-4 (1998)); 3,4-Dichloroisocoumarin 
(Hammed, A., et al., Proc. Sac. Exp. Biol. Med., 219(2) :132-7 (1998)); 
LEX032 (Bains, A.S., et al., Eur. J. Pharmacol., 356(1) :67-72 (1998)); N- 

10 tosyl-L-phenylalanine chloromethyl ketone (Dryjanski, M., et al.. 
Biochemistry, 37(40) :141 51-6 (1998)); Mouse gene for the serine 
protease inhibitor neuroserpin (P1 12) (Berger, P., et al.. Gene, 214(1- 
21:25-33 (1998)); Rat serine protease inhibitor 2.3 gene (Paul, C, et al., 
Eur. J. Biochem., 254(3) :538-46 (1998)); Ecotin (Yang, S.Q., et al., J. 

15 Mol. Biol., 279(4) :945-57 (1998)); A 14 kDa plant-related serine protease 
inhibitor (Roch, P., et al.. Dev. Comp. Immuriol., 22(1) :1-12 (1998)); 
Matrix-associated serine protease inhibitor TFPI-2/33 kDa MSPI (Rao, 
C.N., et al.. Int. J. Cancer, 76(5) :749-56 (1998)); ONO-3403 (Hiwasa, 
T., et al.. Cancer Lett., 126(2) :221-5 (1998)); Bdellastasin (Moser, M., et 

20 a!., Eur. J. Biochem., 253(1 ) :21 2-20 (1998)); Bikunin (Xu, Y., et al., J. 
Mol. Biol., 276(5) :955-66 (1998)); Nafamostat mesilate (Mellgren, K., et 
al., Thromb. Haemost., 79(2) :342-7 (1998)); The growth hormone 
dependent serine protease inhibitor, Spi 2.1 (Maake, C, et al.. 
Endocrinology, 1 38(1 2) :5630-6 (1997)); Growth factor activator inhibitor 

25 type 2, a Kunitz-type serine protease inhibitor (Kawaguchi, T., et al., J. 
Biol. Chem.. 272(44) :27558-64 (1997)); Heat-stable serine protease 
inhibitor protein from ovaries of the desert locust, Schistocerga gregaria 
(Hamdaoui, A., et al., Biochem. Biophys. Res. Commun., 238(2) :357-60 
(1997)); Bikunin, (Delaria, K.A., et al., J. Biol. Chem.. 272(1 8) :1 2209-1 4 

30 (1997)); Human placental bikunin (Marlor, C.W., et al., J. Biol. Chem., 

272(101:12202-8 (1997)); Hepatocyte growth factor activator inhibitor, a 
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novel Kunitz-type serine protease inhibitor (Shimomura, T., et a!., J. Biol. 
Chem., 272(101:6370-6 (1997)); FUT-187, oral serine protease inhibitor, 
(Shiozaki, H., et al., Gan To Kaguku Ryoho, 23(14) : 1971-9 (1996)); 
Extracellular matrix-associated serine protease inhibitors (Mr 33,000, 
5 31,000, and 27,000 (Rao, C.N., et al., Arch. Biochem. Biophys., 
335(11:82-92 (1996)); An irreversible isocoumarin serine protease 
inhibitor (Palencia, D.D., et al., Biol. Reprod., 55(3) :536-42 (1996)); 4-(2- 
aminoethyD-benzenesulfonyl fluoride (AEBSF) (Nakabo, Y., et al., J. 
Leukoc. Biol., 60(31:328-36 (1996)); Neuroserpin (Osterwalder, T., et al., 

10 ElVIBO J., 15(121: 2944-53 (1 996)); Human serine protease inhibitor alpha- 
1 -antitrypsin (Forney, J.R., et al., J. Parasitol.. 82(3) :496-502 (1996)); 
Rat serine protease inhibitor 2.3 (Simar-Blanchet, A.E., et al., Eur. J. 
Biochem., 236(2) :638-48 (1996)); Gebaxate mesilate (parodi, F., et al., J. 
Cardiothorac. Vase. Anesth., 10(2) :235-7 (1996)); Recombinant serine 

15 protease inhibitor, CPTI II (Stankiewicz, M., et al., {Acta Biochim. Pol.. 
43(31:525-9 (1996)); A cysteine-rich serine protease inhibitor (Guamerin 
II) (Kim, D.R., et al., J. Enzym. Inhib., 10(2) :81-91 (1996)); 
Diisopropylfluorophosphate (Lundqvist, H., et al., Inflamm. Res., 
44(1 2) :5 10-7 (1995)); Nexin 1 (Yu, D.W., et al., J. Cell ScL, 108(Pt 

20 121:3867-74 (1995)); LEX032 (Scalia, R., et al.. Shock, 4(41:251-6 
(1995)); Protease nexin I (Houenou, L.J., et al., Proc. Natl. Acad. Sci. 
U.S.A., 92(31:895-9 (1995)); Chymase-directed serine protease inhibitor 
(Woodard S.L., et al., J. Immunol., 1 53(1 1) :501 6-25 (1994)); N-alpha- 
tosyl-L-lysyl-chloromethyl ketone (TLCK) (Bourinbaiar, A.S., et al.. Cell 

25 Immunol., 155(1) :230-6 (1994)); Smpi56 (Ghendler, Y., et al., Exp. 
Parasitol., 78(21:121-31 (1994)); Schistosoma haematobium serine 
protease (Blanton, R.E., et al., Mol. Biochem. Parasitol., 63(11:1-1 1 
(1994)); Spi-1 (Warren, W.C., et al., Mol. Cell Endocrinol., 98(1) :27-32 
(1993)); TAME (Jessop, J. J., et al.. Inflammation, 17(5) :613-31 (1993)); 

30 Antithrombin III (Kalaria, R.N., et al.. Am. J. Pathol., 143(3) :886-93 
(1993)); FOY-305 (Ohkoshi, M., et al.. Anticancer Res. , 13(4) :963-6 
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(1993)); Camostat mesilate (Senda, S., et al.. Intern. Med., 32(4) :350-4 
(1993)); Pigment epithelium-derived factor (Steele, F.R., et al., Proc. Natl. 
Acad. Sci. U.S.A.. 90(4) :1 526-30 (1993)); Antistasin (Holstein, T.W., et 
al., FEBS Lett., 309(3) :288-92 (1992)); The vaccinia virus K2L gene 
5 encodes a serine protease inhibitor (Zhou, J., et a!.. Virology, 1 89(2) :678- 
86 (1992)); Bowman-Birk serine-protease inhibitor (Werner, M.H., et al., 
J. Mol. Biol., 225131:873-89 (1992); FUT-175 (Yanamoto, H., et al.. 
Neurosurgery, 30(3) :358-63 (1992)); FUT-175; (Yanamoto, H., et al.. 
Neurosurgery, 30(3) :351-6. discussion 356-7 (1992)); PAI-I (Yreadwell, 
10 B.V., et al., J. Orthop. Res., 9(3) :309-16 (1991)); 3,4- 

Dichloroisocoumarin (Rusbridge, N.M., et al., FEBS Lett. , 268(1) :133-6 
o (1990)); Alpha 1 -antichymotrypsin (Lindmark, B.E., et al.. Am. Rev. 

fi Respir. Des., 141(4 Pt 1) :884-8 (1990)); P-toluenesulfonyi-L-arginine 

y methyl ester (TAME) (Scuderi, P., J. Immunol., 143(1) :1 68-73 (1989)); 

fi^ 15 Aprotinin (Seto, S., et al.. Adv. Exp. Med. Biol., 2476 :49-64 (1989)); 

Alpha 1 -antichymotrypsin (Abraham, C.R., et al.. Cell, 52(4) :487-501 
= (1988)); Contrapsin (Modha, J., et al.. Parasitology, 96 (Pt 1) :99-109 

nj (1988)); (FOY-305) (Yamauchi, Y., et al., Hiroshima J. Med. Sci., 

y 36(1) :81-7 No abstract available (1987)); Alpha 2-antiplasmin (Holmes, 

n 20 W.E., et al., J. Biol. Chem., 262(4) :1 659-64 (1987)); 3,4- 

dichloroisocoumarin (Harper, J.W., et al.. Biochemistry, 24(8) :1 831 -41 
(1985)); Diisoprophylfluorophosphate (Tsutsui, K., et al., Biochem. 
Biophys. Res. Commun., 123(1) :271-7 (1984)); Gabexate mesilate 
(Hesse, B., et al., Pharmacol. Res. Commun., 16(7) :637-45 (1984)); 
25 Phenyl methyl sulfonyl fluoride (Dufer, J., et al., Scand. J. Haematol., 

32(1) :25-32 (1984)); Aprotinin (Seto, S., et al.. Hypertension, 5(6):893-9 
(1983)); Protease inhibitor CI-2 (McPhalen, C.A., et al., J. Mol. Biol., 
168(2) :445-7 (1983)); Phenylmethylsulfonyl fluoride (Sekar V., et al., 
Biochem. Biophys. Res. Commun., 89(2) :474-8 (1979)); PGE1 (Feinstein, 
30 M.D., et al., Prostaglandine, 1 4(6) :1 075-93 (1977) 

c. Combinatorial libraries and other libraries 
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The source of compounds for the screening assays, can be 
libraries, including, but are not limited to, combinatorial libraries. Methods 
for synthesizing combinatorial libraries and characteristics of such 
combinatorial libraries are known in the art {See generally. Combinatorial 
Libraries: Synthesis, Screening and Application Potential (Cortese Ed.) 
Walter de Gruyter, Inc., 1995; Tietze and Lieb, Curr. Opin, Chem. BioL, 
2(3) :363-71 (1998); Lam, Anticancer Drug Des., 12(3) :145-67 (1997); 
Blaney and Martin, Curr, Opin. Chem. Biol., 1(1) :54-9 (1997); and Schultz 
and Schultz, Biotechnol. Prog., 12(6) :729-43 (1996)). 

Methods and strategies for generating diverse libraries, primarily 
peptide- and nucleotide-based oligomer libraries, have been developed 
using molecular biology methods and/or simultaneous chemical synthesis 
methodologies (see, e,g., Dower et al., Annu. Rep, Med. Chem., 26:271- 
280 (1991); Fodor et al.. Science, 251:767-773 (1991); Jung et al., 
Angew. Chem. Ind. Ed. Engl., 31:367-383 (1992); Zuckerman et al., 
Proc. Natl. Acad. Sci. USA, 89:4505-4509 (1992); Scott et al.. Science, 
249 :386-390 (1990); Devlin et al.. Science, 249:404-406 (1990); Cwiria 
et al., Proc. Natl. Acad. Sci. USA, 87:6378-6382 (1990); and Gallop et 
al., J. Medicinal Chemistry, 37:1233-1251 (1994)). The resulting 
combinatorial libraries potentially contain millions of compounds and that 
can be screened to identify compounds that exhibit a selected activity. 

The libraries fall into roughly three categories: fusion-protein- 
displayed peptide libraries in which random peptides or proteins are 
presented on the surface of phage particles or proteins expressed from 
plasmids; support-bound synthetic chemical libraries in which individual 
compounds or mixtures of compounds are presented on insoluble 
matrices, such as resin beads (see, e.g.. Lam et al.. Nature, 354 :82-84 
(1991)) and cotton supports (see, e.g., Eichler et al.. Biochemistry 
32:11035-11041 (1993)); and methods in which the compounds are 
used in solution (see, e.g., Houghten et al.. Nature, 354 :84-86 (1991); 
Houghten et al., BioTechniques, 313 :412-421 (1992); and Scott et al.. 
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Curr. Opin, BiotechnoL, 5:40-48 (1994)). There are numerous examples 
of synthetic peptide and oligonucleotide combinatorial libraries and there 
are many methods for producing libraries that contain non-peptidic small 
organic molecules. Such libraries can be based on basis set of 
5 monomers that are combined to form mixtures of diverse organic 
molecules or that can be combined to form a library based upon a 
selected pharmacophore monomer. 

Either a random or a deterministic combinatorial library can be 
screened by the presently disclosed and/or claimed screening methods. 

10 In either of these two libraries, each unit of the library is isolated and/or 
immobilized on a solid support. In the deterministic library, one knows a 
priori a particular unit's location on each solid support. In a random 
library, the location of a particular unit is not known a priori although each 
site still contains a single unique unit. Many methods for preparing 

15 libraries are known to those of skill in this art (see, e.g., Geysen et a!., 
Proc. Natl. Acad. Sci. USA, 81:3998-4002 (1984), Houghten et al., Proc. 
Natl. Acad. Sci. USA, 81:5131-5135 (1985)). Combinatorial library 
generated by the any techniques known to those of skill in the art are 
contemplated (see, e.g., Table 1 of Schultz and Schultz, Biotechnol. 

20 Prog., 12(6) :729-43 (1996)) for screening; Bartel etal.. Science, 

261:1411-1418 (1993); Baumbach et aL BioPharm, (Mav) :24-35 (1992); 
Bock et al. Nature, 355 :564-566 (1992); Borman, S., Combinatorial 
chemists focus on samll molecules molecular recognition, and automation, 
Ciiem. Eng. News, 2(12) :29 (1996); Boublik, et aL, Eukaryotic Virus 

25 Display: Engineering the Major Surface Glycoproteins of the Autographa 
California Nuclear Polyhedrosis Virus (ACNPV) for the Presentation of 
Foreign Proteins on the Virus Surface, Bio/Technology, 13:1079-1084 
(1995); Brenner, et al.. Encoded Combinatorial Chemistry, Proc. Natl. 
Acad Sci. U.S.A., 89:5381-5383 (1992); Caflisch, et al., Computational 

30 Combinatorial Chemistry for De Novo Ligand Design: Review and 

Assessment, Perspect. Drug Discovery Des., 3:51-84 (1995); Cheng, et 
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al., Sequence-Selective Peptide Binding with a Peptido-A,B-f/'aA7s-steroidal 
Receptor Selected from an Encoded Combinatorial Library, J. Am. Chem. 
Soc, 118:1813-1814 (1996); Chu, et al.. Affinity Capillary 
Electrophoresis to Identify the Peptide in A Peptide Library that Binds 
5 Most Tightly to Vancomycin, J. Org. Chem., 58:648-652 (1993); 
Clackson, et al.. Making Antibody Fragments Using Phage Display 
Libraries, Nature, 352 :624-628 (1991); Combs, et al.. Protein Structure- 
Based Combinatorial Chemistry: Discovery of Non-Peptide Binding 
Elements to Src SH3 Domain, J, Am, Chem. Soc, 1 18 :287-288 (1996); 
10 Cwiria, et a!.. Peptides On Phage: A Vast Library of Peptides for 

Identifying Ligands, Proc. NatL Acad, Sc'l U.S.A., 87:6378-6382 (1990); 
Ecker, et al,. Combinatorial Drug Discovery: Which Method will Produce 
the Greatest Value, Bio/Technology, 13:351-360 (1995); Ellington, et aL, 
In Vitro Selection of RNA Molecules That Bind Specific Ligands, Nature, 

15 346:818-822 (1990); Ellman, J.A., Variants of Benzodiazephines, J. Am. 
Chem. Soc, 114:10997 (1992); Erickson, et al,. The Proteins; Neurath, 
H., Hill, R.L., Eds.: Academic: New York, 1976; pp. 255-257; Felici, et 
al., J. Mol. Biol., 222:301-310 (1991); Fodor, et a!., Light-Directed, 
Spatially Addressable Parallel Chemical Synthesis, Science, 251:767-773 

20 (1991); Francisco, et al.. Transport and Anchoring of Beta-Lactamase to 
the External Surface of E. Coll., Proc. Natl. Acad. Sci. U.S.A., 89:2713- 
2717 (1992); Georgiou, et al.. Practical Applications of Engineering Gram- 
Negative Bacterial Cell Surfaces, TIBTECH, 11:6-10 (1993); Geysen, et 
al.. Use of peptide synthesis to probe viral antigens for epitopes to a 

25 resolution of a single amino acid, Proc. Natl. Acad. Sci. U.S.A., 81=3998- 
4002 (1984); Glaser, et al.. Antibody Engineering by Condon-Based 
Mutagenesis in a Filamentous Phage Vector System, J. Immunol., 
149:3903-3913 (1992); Gram, et al.. In vitro selection and affinity 
maturation of antibodies from a naive combinatorial immunoglobulin 

30 library, Proc. Natl. Acad. Sci., 89:3576-3580 (1992); Han, et al., Liquid- 
Phase Combinatorial Synthesis, Proc. NatL Acad. Sci. U.S.A., 92:6419- 
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6423 (1995); Hoogenboom, et al., Multi-Subunit Proteins on the Surface 
of Filamentous Phage: Methodologies for Displaying Antibody (Fab) Heavy 
and Light Chains, Nucleic Acids Res,, 19:4133-4137 (1991); Houghten, 
et al., General Method for the Rapid Solid-Phase Synthesis of Large 
5 Numbers of Peptides: Specificity of Antigen-Antibody Interaction at the 
Level of Individual Amino Acids, Proc. Natl. Acad. Sci. U.S. A,, 82:5131- 
5135 (1985); Houghten, et al,. The Use of Synthetic Peptide 
Combinatorial Libraries for the Determination of Peptide Ligands in Radio- 
Receptor Assays-Opiod-Peptides, Bioorg. Med. Chem. Lett., 3:405-412 

10 (1993); Houghten, et al.. Generation and Use of Synthetic Peptide 

Combinatorial Libraries for Basic Research and Drug Discovery, Nature, 
354 :84-86 (1991); Huang, et a!.. Discovery of New Ligand Binding 
Pathways in Myoglobin by Random Mutagenesis, Nature Struct. Biol., 
1:226-229 (1994); Huse, et al.. Generation of a Large Combinatorial 

15 Library of the Immunoglobulin Repertoire In Phage Lambda, Science, 
246:1275-1281 (1989); Janda, K.D., New Strategies for the Design of 
Catalytic Antibodies, BiotechnoL Prog., 6:178-181 (1990); Jung, et al,. 
Multiple Peptide Synthesis Methods and Their Applications, Angew. 
Chem. Int. Ed. EngL, 31:367-486 (1992); Kang, et al„ Linkage of 

20 Recognition and Replication Functions By Assembling Combinatorial 
Antibody Fab Libraries Along Phage Surfaces, Proc. NatL Acad. ScL 
U.S.A., 88:4363-4366 (1991a); Kang, et al.. Antibody Redesign by Chain 
Shuffling from Random Combinatorial Immunoglobulin Libraries, Proc. 
Natl. Acad. Sci. U.S.A., 88:11120-11123 (1991b); Kay, et al.. An Ml 3 

25 Phage Library Displaying Random 38-Amino-Acid-Peptides as a Source of 
Novel Sequences with Affinity to Selected Targets Genes, Gene, 1 28 :59- 
65 (1993); Lam, et al., A new type of synthetic peptide library for 
identifying ligand-binding activity. Nature, 354 :82-84 (1991) (published 
errata apear in Nature, 358 :434 (1992) and Nature, 360 :768 (1992); 

30 Lebl, et al.. One Bead One Structure Combinatorial Libraries, Biopolymers 
fPept. ScL), 37:177-198 (1995); Lerner, et al.. Antibodies without 
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Immunization, Science, 258 :1313-1314 (1992); Li, et al.. Minimization of 
a Polypeptide Hormone, Science, 270 :1657-1660 (1995); Light, et aL, 
Display of Dimeric Bacterial Alkaline Phosphatase on the Major Coat 
Protein of Filamentous Bacteriophage, Bioorg. Med. Chem. Lett., 3:1073- 
5 1079 (1992); Little, et al.. Bacterial Surface Presentation of Proteins and 
Peptides: An Alternative to Phage Technology, Trends Bioteclinol., 1 1 :3-5 
(1993); Marks, et al., By-Passing Immunization. Human Antibodies from 
V-Gene Libraries Displayed on Phage, J. MoL Biol,, 222 :581-597 (1991); 
Matthews, et al.. Substrate Phage: Selection of Protease Substrates by 

10 Monovalent Phage Display, Science, 260 :1 113-1117 (1993); McCafferty, 
et al.. Phage Enzymes: Expression and Affinity Chromatography of 
Functional Alkaline Phosphatase on the Surface of Bacteriophage, Protein 
Eng., 4:955-961 (1991); Menger, et al.. Phosphatase Catalysis Developed 
Via Combinatorial Organic Chemistry, J. Org. Cfiem., 60:6666-6667 

15 (1995); Nicolaou, et a!., Angew. Chem. int. Ed. Engl., 34:2289-2291 
(1995); Oldenburg, et al.. Peptide Ligands for A Sugar-Binding Protein 
Isolated from a Random Peptide Library, Proc. Natl. Acad. Sci. U.S.A., 
89:5393-5397 (1992); Parmley, et al., Antibody-Selectable Filamentous 
fd Phage Vectors: Affinity Purification of Target Genes, Genes, 73:305- 

20 318 (1988); Pinilla, et al.. Synthetic Peptide Combinatorial Libraries 
(SPCLS)-ldentification of the Antigenic Determinant of Beta-Endorphin 
Recognized by Monoclonal Antibody-3E7, Gene, 128 :71-76 (1993); 
Pinilla, et al.. Review of the Utility of Soluble Combinatorial Libraries, 
Biopolymers, 37:221-240 (1995); Pistor, et al.. Expression of Viral 

25 Hemegglutinan On the Surface of E. Coll., Klin. Wochenschr. , 66:1 10- 
1 16 (1989); Pollack, et aL, Selective Chemical Catalysis by an Antibody, 
Science, 234 :1570-1572 (1986); Rigler, et aL, Fluorescence Correlations, 
Single Molecule Detection and Large Number Screening: Applications in 
Biotechnology, J. BiotechnoL, 41:177-186 (1995); Sarvetnick, et aL, 

30 Increasing the Chemical Potential of the Germ-Line Antibody Repertoire, 
Proc. Natl. Acad. Sci. U.S.A., 90:4008-401 1 (1993); Sastry, et aL, 
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Cloning of the Immunological Repertiore in Escherichia Co// for Generation 
of Monoclonal Catalytic Antibodies: Construction of a Heavy Chain 
Variable Region-Specific cDNA Library, Proc, Natl, Acad. Sci. U.S.A., 
86:5728-5732 (1989); Scott, et a!.. Searching for Peptide Ligands with 
5 an Epitope Library, Science, 249 :386-390 (1990); Sears, et al.. 

Engineering Enzymes for Bioorganic Synthesis: Peptide Bond Formation, 
Biotechnol. Prog., 12:423-433 (1996); Simon, et. al., Peptides: A 
Modular Approach to Drug Discovery, Proc. Natl. Acad. Sci. U.S.A., 
89:9367-9371 (1992); Still, et al.. Discovery of Sequence-Selective 

10 Peptide Binding by Synthetic Receptors Using Encoded Combinatorial 
Libraries, Acc. Chem. Res., 29:155-163 (1996); Thompson, et al., 
Synthesis and Applications of Small Molecule Libraries, Chem. Rev., 
96:555-600 (1996); Tramontane, et al.. Catalytic Antibodies, Science, 
234 :1566-1570 (1986); Wrighton, et al.. Small Peptides as Potent 

15 Mimetics of the Protein Hormone Erythropoietin, Science, 273:458-464 
(1996); York, et al.. Combinatorial mutagenesis of the reactive site region 
in plasminogen activator inhibitor I, J. Biol. Chem., 266 :8595-8600 
(1991); Zebedee, et al.. Human Combinatorial Antibody Libraries to 
Hepatitis B Surface Antigen, Proc. Natl. Acad. Sci. U.S.A., 89:3175-3179 

20 (1992); Zuckerman, et a!.. Identification of Highest-Affinity Ligands by 
Affinity Selection from Equimolar Peptide Mixtures Generated by Robotic 
Synthesis, Proc. Natl. Acad. Sci. U.S.A., 89:4505-4509 (1992). 

For example, peptides that bind to an MTSP protein or a protease 
domain of an MTSP protein can be identified using phage display libraries. 

25 In an exemplary embodiment, this method can include a) contacting 

phage from a phage library with the MTSP protein or a protease domain 
thereof; (b) isolating phage that bind to the protein; and (c) determining 
the identity of at least one peptide coded by the isolated phage to identify 
a peptide that binds to an MTSP protein. 
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H. Modulators of the activity of MTSP proteins 

Provided herein are compounds, identified by screening or produced 
using the MTSP proteins or protease domain in other screening methods, 
that modulate the activity of an MTSP. These compounds act by directly 
5 interacting with the MTSP protein or by altering transcription or 
translation thereof. Such molecules include, but are not limited to, 
antibodies that specifically react with an MTSP protein, particularly with 
the protease domain thereof, antisense nucleic acids that alter expression 
of the MTSP protein, antibodies, peptide mimetics and other such 

10 compounds. 

1 . Antibodies 

Antibodies, including polyclonal and monoclonal antibodies, that 
specifically bind to the MTSP proteins provided herein, particularly to the 
single chain protease domains thereof are provided. Preferably, the 

15 antibody is a monoclonal antibody, and preferably, the antibody 
specifically binds to the protease domain of the MTSP protein. In 
particular embodiments, antibodies to each of the single chain of protease 
domain of MTSP1 , MTSP3, MTSP4 and MTSP6 Also provided are 
antibodies that specifically bind to any domain of MTSP3 or MTSP4, and 

20 to double chain forms thereof. 

The MTSP protein and domains, fragments, homologs and 
derivatives thereof may be used as immunogens to generate antibodies 
that specifically bind such immunogens. Such antibodies include but are 
not limited to polyclonal, monoclonal, chimeric, single chain. Fab 

25 fragments, and an Fab expression library. In a specific embodiment, 

antibodies to human MTSP protein are produced. In another embodiment, 
complexes formed from fragments of MTSP protein, which fragments 
contain the serine protease domain, are used as immunogens for antibody 
production. 

30 Various procedures known in the art may be used for the 

production of polyclonal antibodies to MTSP protein, its domains. 
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derivatives, fragments or analogs. For production of the antibody, various 
host animals can be immunized by injection with the native MTSP protein 
or a synthetic version, or a derivative of the foregoing, such as a cross- 
linked MTSP protein. Such host animals include but are not limited to 
5 rabbits, mice, rats, etc. Various adjuvants can be used to increase the 
immunological response, depending on the host species, and include but 
are not limited to Freund's (complete and incomplete), mineral gels such 
as aluminum hydroxide, surface active substances such as lysolecithin, 
pluronic polyols, polyanions, peptides, oil emulsions, dinitrophenol, and 
10 potentially useful human adjuvants such as bacille Caimette-Guerin (BCG) 
and corynebacterium parvum. 

For preparation of monoclonal antibodies directed towards an 
MTSP protein or domains, derivatives, fragments or analogs thereof, any 
technique that provides for the production of antibody molecules by 
15 continuous cell lines in culture may be used. Such techniques include but 
are not restricted to the hybridoma technique originally developed by 
Kohler and Milstein [Nature 256:495-497 (1975)), the trioma technique, 
the human B-cell hybridoma technique (Kozbor et al.. Immunology Today 
4:72 (1983)), and the EBV hybridoma technique to produce human 
20 monoclonal antibodies (Cole et al., in Monoclonal Antibodies and Cancer 
Therapy, Alan R, Liss, Inc., pp. 77-96 (1985)). In an additional 
embodiment, monoclonal antibodies can be produced in germ-free animals 
utilizing recent technology (PCT/US90/02545), Human antibodies may be 
used and can be obtained by using human hybridomas (Cote et al., Proc. 
25 Natl. Acad. Sci. USA 80:2026-2030 (1983)). Or by transforming human 
B cells with EBV virus in vitro (Cole et al., in Monoclonal Antibodies and 
Cancer Therapy, Alan R. Liss, Inc., pp. 77-96 (1985)). Techniques 
developed for the production of "chimeric antibodies" (Morrison et al., 
Proc. Natl. Acad. Sci. USA 81:6851-6855 (1984); Neuberger et al., 
30 Nature 312 :604-608 (1984); Takeda et a!.. Nature 314 :452-454 (1985)) 
by splicing the genes from a mouse antibody molecule specific for the 
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MTSP protein together with genes from a human antibody molecule of 
appropriate biological activity can be used. 

Techniques described for the production of single chain antibodies 
(U.S. patent 4,946,778) can be adapted to produce MTSP protein-specific 
5 single chain antibodies. An additional embodiment uses the techniques 
described for the construction of Fab expression libraries (Huse et al.. 
Science 246 :1 275-1 281 (1989)) to allow rapid and easy identification of 
monoclonal Fab fragments with the desired specificity for MTSP protein or 
MTSP protein, or domains, derivatives, or analogs thereof. Non-human 
10 antibodies can be "humanized" by known methods {see, e.g., U.S. Patent 
No. 5,225,539). 

Antibody fragments that contain the idiotypes of MTSP protein can 
be generated by techniques known in the art. For example, such 
fragments include but are not limited to: the F(ab')2 fragment which can 

15 be produced by pepsin digestion of the antibody molecule; the Fab' 

fragments that can be generated by reducing the disulfide bridges of the 
F(ab')2 fragment, the Fab fragments that can be generated by treating the 
antibody molecular with papain and a reducing agent, and Fv fragments. 
In the production of antibodies, screening for the desired antibody 

20 can be accomplished by techniques known in the art, e.g., ELISA 

(enzyme-linked immunosorbent assay). To select antibodies specific to a 
particular domain of the MTSP protein one may assay generated 
hybridomas for a product that binds to the fragment of the MTSP protein 
that contains such a domain 

25 The foregoing antibodies can be used in methods known in the art 

relating to the localization and/or quantitation of MTSP protein proteins, 
e.g., for imaging these proteins, measuring levels thereof in appropriate 
physiological samples, in diagnostic methods, etc. 

In another embodiment, (see infra), anti-MTSP protein antibodies, 

30 or fragments thereof, containing the binding domain are used as 
therapeutic agents. 
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2. Peptides and Peptide IWimetics 

Provided herein are methods for identifying molecules that bind to 
and modulate the activity of MTSP proteins. Included among molecules 
that bind to MTSPs, particularly the single chain protease domain or 
5 catalytically active fragments thereof, are peptides and peptide mimetics- 
Peptide mimetics are molecules or compounds that mimic the necessary 
molecular conformation of a ligand or polypeptide for specific binding to a 
target molecule such as, e.g., an MTSP protein. In an exemplary 
embodiment, the peptides or peptide mimetics bind to the protease 

10 domain of the MTSP protein. Such peptides and peptide mimetics include 
those of antibodies that specifically bind an MTSP protein and, preferably, 
bind to the protease domain of an MTSP protein. The peptides and 
peptide mimetics identified by methods provided herein can be agonists or 
antagonists of MTSP proteins. 

15 Such peptides and peptide mimetics are useful for diagnosing, 

treating, preventing, and screening for a disease or disorder associated 
with MTSP protein activity in a mammal. In addition, the peptides and 
peptide mimetics are useful for identifying, isolating, and purifying 
molecules or compounds that modulate the activity of an MTSP protein, 

20 or specifically bind to an MTSP protein, preferably, the protease domain 
of an MTSP protein. Low molecular weight peptides and peptide 
mimetics can have strong binding properties to a target molecule, e.g., an 
MTSP protein or, preferably, to the protease domain of an MTSP protein. 
Peptides and peptide mimetics that bind to MTSP proteins as 

25 described herein can be administered to mammals, including humans, to 
modulate MTSP protein activity. Thus, methods for therapeutic treatment 
and prevention of neoplastic diseases comprise administering a peptide or 
peptide mimetic compound in an amount sufficient to modulate such 
activity are provided. Thus, also provided herein are methods for treating 

30 a subject having such a disease or disorder in which a peptide or peptide 
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mimetic compound is administered to the subject in a therapeutically 
effective dose or amount. 

Compositions containing the peptides or peptide mimetics can be 
administered for prophylactic and/or therapeutic treatments. In 
therapeutic applications, compositions can be administered to a patient 
already suffering from a disease, as described above, in an amount 
sufficient to cure or at least partially arrest the symptoms of the disease 
and its complications. Amounts effective for this use will depend on the 
severity of the disease and the weight and general state of the patient. 

In prophylactic applications, compositions containing the peptides 
and peptide mimetics are administered to a patient susceptible to or 
otherwise at risk of a particular disease. Such an amount is defined to be 
a "prophylactically effective dose", in this use, the precise amounts again 
depend on the patient's state of health and weight. 

Accordingly, the peptides and peptide mimetics that bind to an 
MTSP protein can be used generating pharmaceutical compositions 
containing, as an active ingredient, at least one of the peptides or peptide 
mimetics in association with a pharmaceutical carrier or diluent. The 
compounds can be administered, for example, by oral, pulmonary, 
parental (intramuscular, intraperitoneal, intravenous (IV) or subcutaneous 
injection), inhalation {via a fine powder formulation), transdermal, nasal, 
vaginal, rectal, or sublingual routes of administration and can be 
formulated in dosage forms appropriate for each route of administration 
(see, e.g., International PCT application Nos. WO 93/25221 and WO 
94/17784; and European Patent Application 613,683). 

Peptides and peptide mimetics that bind to MTSP proteins are 
useful in vitro as unique tools for understanding the biological role of 
MTSP proteins, including the evaluation of the many factors thought to 
influence, and be influenced by, the production of MTSP protein. Such 
peptides and peptide mimetics are also useful in the development of other 
compounds that bind to and modulate the activity of an MTSP protein. 
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because such compounds provide important information on the 
relationship between structure and activity that should facilitate such 
development. 

The peptides and peptide mimetics are also useful as competitive 
5 binders in assays to screen for new MTSP proteins or MTSP protein 
agonists. In such assay embodiments, the compounds can be used 
without modification or can be modified in a variety of ways; for example, 
by labeling, such as covalentiy or non-covalently joining a moiety which 
directly or indirectly provides a detectable signal. In any of these assays, 

10 the materials thereto can be labeled either directly or indirectly. 

Possibilities for direct labeling include label groups such as: radiolabels 
such as ^^^1 enzymes (U.S. Pat. No. 3,645,090) such as peroxidase and 
alkaline phosphatase, and fluorescent labels {U.S. Pat. No. 3,940,475) 
capable of monitoring the change in fluorescence intensity, wavelength 

15 shift, or fluorescence polarization. Possibilities for indirect labeling include 
biotinylation of one constituent followed by binding to avidin coupled to 
one of the above label groups. The compounds may also include spacers 
or linkers in cases where the compounds are to be attached to a solid 
support. 

20 Moreover, based on their ability to bind to an MTSP protein, the 

peptides and peptide mimetics can be used as reagents for detecting 
MTSP proteins in living cells, fixed cells, in biological fluids, in tissue 
homogenates, in purified, natural biological materials, etc. For example, 
by labelling such peptides and peptide mimetics, one can identify cells 

25 having MTSP proteins. In addition, based on their ability to bind an MTSP 
protein, the peptides and peptide mimetics can be used in situ staining, 
FACS (fluorescence-activated cell sorting), Western blotting, ELISA, etc. 
In addition, based on their ability to bind to an MTSP protein, the peptides 
and peptide mimetics can be used in purification of MTSP protein 

30 polypeptides or in purifying cells expressing the MTSP protein 
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polypeptides, e,g., a polypeptide encoding the protease domain of an 
MTSP protein. 

The peptides and peptide mimetics can also be used as commercial 
reagents for various medical research and diagnostic uses. 
5 The activity of the peptides and peptide mimetics can be evaluated 

either in vitro or in vivo in one of the numerous models described in 
McDonald (1992) Anr), J. of Pediatric Hematology/Oncology, 74:8-21, 
which is incorporated herein by reference. 

3- Peptide and peptide mimetic therapy 

10 Peptides and peptide mimetics that can bind to MTSP proteins or 

the protease domain of MTSP proteins and modulate the activity thereof, 
or have MTSP protein activity, can be used for treatment of neoplastic . 
The peptides and peptide mimetics may be delivered, in vivo or ex vivo, 
to the cells of a subject in need of treatment. Further, peptides which 

15 have MTSP protein activity can be delivered, in vivo or ex vivo, to cells 
which carry mutant or missing alleles encoding the MTSP protein gene. 
Any of the techniques described herein or known to the skilled artisan can 
be used for preparation and in vivo or ex vivo delivery of such peptides 
and peptide mimetics that are substantially free of other human proteins. 

20 For example, the peptides can be readily prepared by expression in a 
microorganism or synthesis in vitro. 

The peptides or peptide mimetics can be introduced into cells, in 
vivo or ex vivo, by microinjection or by use of liposomes, for example. 
Alternatively, the peptides or peptide mimetics may be taken up by cells, 

25 in vivo or ex vivo, actively or by diffusion. In addition, extracellular 

application of the peptide or peptide mimetic may be sufficient to effect 
treatment of a neoplastic disease. Other molecules, such as drugs or 
organic compounds, that: 1) bind to an MTSP protein or protease domain 
thereof; or 2) have a similar function or activity to an MTSP protein or 

30 protease domain thereof, may be used in methods for treatment. 
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4, Rational drug design 

The goal of rational drug design is to produce structural analogs of 
biologically active polypeptides or peptides of interest or of small 
molecules or peptide mimetics with which they interact (e.g., agonists, 
5 antagonists, inhibitors) in order to fashion drugs which are, e.g., more 
active or stable forms thereof; or which, e.g., enhance or interfere with 
the function of a polypeptide in vivo (e.g., an MTSP protein). In one 
approach, one first determines the three-dimensional structure of a protein 
of interest (e.g., an MTSP protein or polypeptide having a protease 

10 domain) or, for example, of a MTSP protein-ligand complex, by X-ray 
crystallography, by computer modeling or most typically, by a 
combination of approaches (see, e.g., Erickson et al. 1990). Also, useful 
information regarding the structure of a polypeptide may be gained by 
modeling based on the structure of homologous proteins. In addition, 

15 peptides can be analyzed by an alanine scan. In this technique, an amino 
acid residue is replaced by Ala, and its effect on the peptide's activity is 
determined. Each of the amino acid residues of the peptide is analyzed in 
this manner to determine the important regions of the peptide. 

Also, a polypeptide or peptide that binds to an MTSP protein or, 

20 preferably, the protease domain of an MTSP protein, can be selected by a 
functional assay, and then the crystal structure of this polypeptide or 
peptide can be determined. The polypeptide can be, for example, an 
antibody specific for an MTSP protein or the protein domain of an MTSP 
protein. This approach can yield a pharmacore upon which subsequent 

25 drug design can be based. Further, it is possible to bypass the 

crystallography altogether by generating anti-idiotypic polypeptides or 
peptides, (anti-ids) to a functional, pharmacologically active polypeptide 
or peptide that binds to an MTSP protein or protease domain of an MTSP 
protein. As a mirror image of a mirror image, the binding site of the 

30 anti-ids is expected to be an analog of the original target molecule, e.g., 
an MTSP protein or polypeptide having an MTSP protein. The anti-id 
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could then be used to identify and isolate peptides from banks of 
chemically or biologically produced banks of peptides. Selected peptides 
would then act as the pharmacore. 

Thus, one may design drugs which have, e.g., improved activity or 
5 stability or which act as modulators (e.g., inhibitors, agonists, 

antagonists, etc.) of MTSP protein activity, and are useful in the methods, 
particularly the methods for diagnosis, treatment, prevention, and 
screening of a neoplastic disease. By virtue of the availability of cloned 
MTSP protein sequences, sufficient amounts of the MTSP protein 

10 polypeptide may be made available to perform such analytical studies as 
X-ray crystallography. In addition, the knowledge of the amino acid 
sequence of an MTSP protein or the protease domain thereof, e.g., the 
protease domain encoded by the amino acid sequence of SEQ ID NO: 2, 
can provide guidance on computer modeling techniques in place of, or in 

15 addition to. X-ray crystallography. 

Methods of identifying peptides and peptide mimetics that 
bind to IVITSP proteins 

Peptides having a binding affinity to the MTSP protein polypeptides 

provided herein {e.g.^ an MTSP protein or a polypeptide having a protease 

20 domain of an MTSP protein) can be readily identified, for example, by 
random peptide diversity generating systems coupled with an affinity 
enrichment process. Specifically, random peptide diversity generating 
systems include the "peptides on plasmids" system (see, e,g., U.S. 
Patent Nos. 5,270,170 and 5,338,665); the "peptides on phage" system 

25 (see, e.g,, U.S. Patent No. 6,121,238 and Cwirla,efaA (1990) Proc. 
Natl. Acad, Sci. U.S.A. 57:6378-6382); the "polysome system;" the 
"encoded synthetic library (ESL)" system; and the "very large scale 
immobilized polymer synthesis" system (see, e.g., U.S. Patent No. 
6,121,238; and Dower et al. (1991) An, Rep. Med. Chem. 25:271-280 

30 For example, using the procedures described above, random 

peptides can generally be designed to have a defined number of amino 
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acid residues in length {e.g., 12). To generate the collection of 
oligonucleotides encoding the random peptides, the codon motif {NNK)x, 
where N is nucleotide A, C, G, or T (equimolar; depending on the 
methodology employed, other nucleotides can be employed), K is G or T 
5 (equimolar), and x is an integer corresponding to the number of amino 
acids in the peptide {e.g., 12) can be used to specify any one of the 32 
possible codons resulting from the NNK motif: 1 for each of 12 amino 
acids, 2 for each of 5 amino acids, 3 for each of 3 amino acids, and only 
one of the three stop codons. Thus, the NNK motif encodes all of the 

10 amino acids, encodes only one stop codon, and reduces codon bias. 

The random peptides can be presented, for example, either on the 
surface of a phage particle, as part of a fusion protein containing either 
the pill or the pVIII coat protein of a phage fd derivative (peptides on 
phage) or as a fusion protein with the Lad peptide fusion protein bound 

15 to a plasmid (peptides on plasmids). The phage or plasmids, including the 
DNA encoding the peptides, can be identified and isolated by an affinity 
enrichment process using immobilized MTSP protein polypeptide having a 
protease domain. The affinity enrichment process, sometimes called 
"panning," typically involves multiple rounds of incubating the phage, 

20 plasmids, or polysomes with the immobilized MTSP protein polypeptide, 
collecting the phage, plasmids, or polysomes that bind to the MTSP 
protein polypeptide (along with the accompanying DNA or mRNA), and 
producing more of the phage or plasmids (along with the accompanying 
Lacl-peptide fusion protein) collected. 

25 Characteristics of peptides and peptide mimetics 

Typically, the molecular weight of preferred peptides or peptide 
mimetics is from about 250 to about 8,000 daltons. If the peptides are 
oligomerized, dimerized and/or derivatized with a hydrophilic polymer 
(e.g., to increase the affinity and/or activity of the compounds), the 

30 molecular weights of such peptides can be substantially greater and can 
range anywhere from about 500 to about 120,000 daltons, more 
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preferably from about 8,000 to about 80,000 daltons. Such peptides can 
comprise 9 or more amino acids wherein the amino acids are naturally 
occurring or synthetic (non-naturally occurring) amino acids. One skilled 
in the art would know how to determine the affinity and molecular weight 
5 of the peptides and peptide mimetics suitable for therapeutic and/or 

diagnostic purposes (e.g., see Dower et aL, U.S. Patent No. 6,121,238). 

The peptides may be covalently attached to one or more of a 
variety of hydrophilic polymers. Suitable hydrophilic polymers include, 
but are not limited to, poiyalkylethers as exemplified by polyethylene 

10 glycol and polypropylene glycol, polylactic acid, polyglycolic acid, 
polyoxyalkenes, polyvinylalcohol, polyvinylpyrrolidone, cellulose and 
cellulose derivatives, dextran and dextran derivatives, etc. When the 
peptide compounds are derivatized with such polymers, their solubility 
and circulation half-lives can be increased with little, if any, diminishment 

15 in their binding activity. The peptide compounds may be dimerized and 
each of the dimeric subunits can be covalently attached to a hydrophilic 
polymer. The peptide compounds can be PEGylated, i.e., covalently 
attached to polyethylene glycol (PEG). 

Peptide analogs are commonly used in the pharmaceutical industry 

20 as non-peptide drugs with properties analogous to those of the template 
peptide. These types of non-peptide compounds are termed "peptide 
mimetics" or "peptidomimetics" (Luthman et aL, A Textbook of Drug 
Design and Development 1 4:386-406 ^ 2nd Ed., Harwood Academic 
Publishers (1996); Joachim Grante (1994) Angew. Chem. Int. Ed. EngL, 

25 55:1699-1720; Fauchere (1986) J, Adv. Drug Res., 75:29; Veber and 
Freidinger (1985) TINS, p. 392; and Evans et aL (1987) J, Med. Chem. 
50:1229). Peptide mimetics that are structurally similar to therapeutically 
useful peptides may be used to produce an equivalent or enhanced 
therapeutic or prophylactic effect. Preparation of peptidomimetics and 

30 structures thereof are known to those of skill in this art. 
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Systematic substitution of one or more amino acids of a consensus 
sequence with a D-amino acid of the same type {e.g., D-lysine in place of 
L-lysine) may be used to generate more stable peptides. In addition, 
constrained peptides containing a consensus sequence or a substantially 
5 identical consensus sequence variation may be generated by methods 
known in the art (Rizo et af. (1992) An. Rev. Biochem., 57:387, 
incorporated herein by reference); for example^ by adding internal 
cysteine residues capable of forming intramolecular disulfide bridges 
which cyclize the peptide. 

10 Those skilled in the art would appreciate that modifications may be 

made to the peptides and mimetics without deieteriously effecting the 
biological or functional activity of the peptide. Further, the skilled artisan 
would know how to design non-peptide structures in three dimensional 
terms, that mimic the peptides that bind to a target molecule, e.g., an 

15 MTSP protein or, preferably, the protease domain of MTSP proteins (see, 
e.g., Eck and Sprang (1989) J. BioL Chem,, 26: 17605-18795). 

When used for diagnostic purposes, the peptides and peptide 
mimetics may be labeled with a detectable label and, accordingly, the 
peptides and peptide mimetics without such a label can serve as 

20 intermediates in the preparation of labeled peptides and peptide mimetics. 
Detectable labels can be molecules or compounds, which when covalently 
attached to the peptides and peptide mimetics, permit detection of the 
peptide and peptide mimetics in vivo, for example, in a patient to whom 
the peptide or peptide mimetic has been administered, or in vitro, e.g., in 

25 a sample or cells. Suitable detectable labels are well known in the art and 
include, by way of example, radioisotopes, fluorescent labels {e.g., 
fluorescein), and the like. The particular detectable label employed is not 
critical and is selected relative to the amount of label to be employed as 
well as the toxicity of the label at the amount of label employed. 

30 Selection of the label relative to such factors is well within the skill of the 
art. 
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Covalent attachment of a detectable label to the peptide or peptide 
mimetic is accomplished by conventional methods well known in the art. 
For example, when the ^^^1 radioisotope is employed as the detectable 
label, covalent attachment of ^^^1 to the peptide or the peptide mimetic 
5 can be achieved by incorporating the amino acid tyrosine into the peptide 
or peptide mimetic and then iodimating the peptide (see, e.g., Weaner et 
al. (1994) Synthesis and Applications of Isotopically Labelled Compounds, 
pp. 137-140), If tyrosine is not present in the peptide or peptide mimetic, 
incorporation of tyrosine to the N or C terminus of the peptide or peptide 

10 mimetic can be achieved by well known chemistry. Likewise, ^^P can be 
incorporated onto the peptide or peptide mimetic as a phosphate moiety 
through, for example, a hydroxyl group on the peptide or peptide mimetic 
using conventional chemistry. 

Labeling of peptidomimetics usually involves covalent attachment 

15 of one or more labels, directly or through a spacer (e.g., an amide group), 
to non-interfering position(s) on the peptidomimetic that are predicted by 
quantitative structure-activity data and/or molecular modeling. Such 
non-interfering positions generally are positions that do not form direct 
contacts with the macromolecules(s) to which the peptidomimetic binds 

20 to produce the therapeutic effect. Derivatization (e.g., labeling) of 
peptidomimetics should not substantially interfere with the desired 
biological or pharmacological activity of the peptidomimetic, 

6. Methods of preparing peptides and peptide mimetics 
Peptides that bind to MTSP proteins can be prepared by classical 

25 methods known in the art, for example, by using standard solid phase 
techniques. The standard methods include exclusive solid phase 
synthesis, partial solid phase synthesis methods, fragment condensation, 
classical solution synthesis, and even by recombinant DNA technology 
(see, e.g., Merrifield (1963) J. Am. Chem. Soc„ 55:2149, incorporated 

30 herein by reference.) 
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Using the "encoded synthetic library" or "very large scale 
immobilized polymer synthesis" systems (see/e.g., U.S. Patent No. 
5,925,525, and 5,902,723); one can not only determine the minimum 
size of a peptide with the activity of interest, one can also make all of the 
5 peptides that form the group of peptides that differ from the preferred 
motif (or the minimum size of that motif) in one, two, or more residues. 
This collection of peptides can then be screened for ability to bind to the 
target molecule, e.g., and MTSP protein or, preferably, the protease 
domain of an MTSP protein. This immobilized polymer synthesis system 

10 or other peptide synthesis methods can also be used to synthesize 

truncation analogs and deletion analogs and combinations of truncation 
and deletion analogs of the peptide compounds. 

These procedures can also be used to synthesize peptides in which 
amino acids other than the 20 naturally occurring, genetically encoded 

15 amino acids are substituted at one, two, or more positions of the peptide. 
For instance, naphthylalanine can be substituted for tryptophan, 
facilitating synthesis. Other synthetic amino acids that can be substituted 
into the peptides include L-hydroxypropyl, L-3, 4-dihydroxy-phenylalanyl, 
d amino acids such as L-d-hydroxylysyl and D-d-methylalanyl, 

20 L-a-methylalanyl, yff amino acids, and isoquinolyl. D amino acids and 
non-naturally occurring synthetic amino acids can also be incorporated 
into the peptides (see, e.g., Roberts et aL (1983) Unusual Amino/Acids in 
Peptide Synthesis, 5(6):341-449). 

The peptides may also be modified by phosphorylation (see, e.g., 

25 W. Bannwarth et al. (1 996) Biorganic and l\/ledicinal Chemistry Letters, 
ff(17):2141-2146), and other methods for making peptide derivatives 
(see, e.g., Hruby et al. (1990) Biochem. J., 268(2) :249-262). Thus, 
peptide compounds also serve as a basis to prepare peptide mimetics 
with similar biological activity. 

30 Those of skill in the art recognize that a variety of techniques are 

available for constructing peptide mimetics with the same or similar 
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desired biological activity as the corresponding peptide compound but 
with more favorable activity than the peptide with respect to solubility, 
stability, and susceptibility to hydrolysis and proteolysis {see, e.g,, 
Morgan et aL (1989) An, Rep. Med. Chem., 24:243-252). Methods for 
5 preparing peptide mimetics modified at the N-terminal amino group, the 
C-terminal carboxyl group, and/or changing one or more of the amido 
linkages in the peptide to a non-amido linkage are known to those of skill 
in the art. 

Amino terminus modifications include alkylating, acetylating, 

10 adding a carbobenzoyi group, forming a succinimide group, etc. (see, 
e,g.r Murray et aL (1 995) Burger's Medicinal Chemistry and Drug 
Discovery, 5t/i ed., VoL 7, Manfred E. Wolf, ed., John Wiley and Sons, 
Inc.). C-terminal modifications include mimetics wherein the C-terminal 
carboxyl group is replaced by an ester, an amide or modifications to form 

15 a cyclic peptide. 

In addition to N-terminal and C-terminal modifications, the peptide 
compounds, including peptide mimetics, can advantageously be modified 
with or covalently coupled to one or more of a variety of hydrophilic 
polymers. It has been found that when peptide compounds are 

20 derivatized with a hydrophilic polymer, their solubility and circulation 
half-lives may be increased and their immunogenicity is masked, with 
little, if any, diminishment in their binding activity. Suitable 
nonproteinaceous polymers include, but are not limited to, polyalkylethers 
as exemplified by polyethylene glycol and polypropylene glycol, polylactic 

25 acid, polyglycolic acid, polyoxyalkenes, polyvinylalcohol, 

polyvinylpyrrolidone, cellulose and cellulose derivatives, dextran and 
dextran derivatives, etc. Generally, such hydrophilic polymers have an 
average molecular weight ranging from about 500 to about 100,000 
daltons, more preferably from about 2,000 to about 40,000 daltons and, 

30 even more preferably, from about 5,000 to about 20,000 daltons. The 
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hydrophilic polymers also can have an average molecular weights of 
about 5,000 daltons, 10,000 daltons and 20,000 daltons. 

Methods for derivatizing peptide compounds or for coupling 
peptides to such polymers have been described (see, e.g., Zailipsky 
5 (1995) Bioconjugate Chem., 6:150-165; Monfardini et al. (1995) 

Bioconjugate Chem., 6:62-69; U.S. Pat. No. 4,640,835; U.S. Pat. No. 
4,496,689; U.S. Pat. No. 4,301,144; U.S. Pat. No. 4,670,417; U.S. Pat. 
No. 4,791,192; U.S. Pat. No. 4,179,337 and WO 95/34326, all of which 
are incorporated by reference in their entirety herein). 

10 Other methods for making peptide derivatives are described, for 

example, in Hruby et al. (1990), BiochemJ., 265(2):249-262, which is 
incorporated herein by reference. Thus, the peptide compounds also 
serve as structural models for non-peptidic compounds with similar 
biological activity. Those of skill in the art recognize that a variety of 

15 techniques are available for constructing compounds with the same or 
similar desired biological activity as a particular peptide compound but 
with more favorable activity with respect to solubility, stability, and 
susceptibility to hydrolysis and proteolysis (see, e.g., Morgan et al. 
(1989) An. Rep. Med. Chem., 24:243-252, incorporated herein by 

20 reference). These techniques include replacing the peptide backbone with 
a backbone composed of phosphonates, amidates, carbamates, 
sulfonamides, secondary amines, and N-methylamino acids. 

Peptide compounds may exist in a cyclized form with an 
intramolecular disulfide bond between the thiol groups of the cysteines. 

25 Alternatively, an intermolecular disulfide bond between the thiol groups of 
the cysteines can be produced to yield a dimeric (or higher oligomeric) 
compound. One or more of the cysteine residues may also be substituted 
with a homocysteine. 
I. CONJUGATES 

30 A conjugate, containing: a) a single chain protease domain (or 

proteolytically active portion thereof) of an MTSP protein or an MTSP3, 
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MTSP4 or MTSP6 full length zymogen, activated form thereof, or double 
or single chain protease domain thereof; and b) a targeting agent linked to 
the MTSP protein directly or via a linker, wherein the agent facilitates: i) 
affinity isolation or purification of the conjugate; ii) attachment of the 
5 conjugate to a surface; iii) detection of the conjugate; or iv) targeted 
delivery to a selected tissue or cell, is provided herein. The conjugate can 
be a chemical conjugate or a fusion protein mixture thereof. 

The targeting agent is preferably a protein or peptide fragment, 
such as a tissue specific or tumor specific monoclonal antibody or 

10 growth factor or fragment thereof linked either directly or via a linker to 
an MTSP protein or a protease domain thereof. The targeting agent may 
also be a protein or peptide fragment that contains a protein binding 
sequence, a nucleic acid binding sequence, a lipid binding sequence, a 
polysaccharide binding sequence, or a metal binding sequence, or a linker 

15 for attachment to a solid support. In a particular embodiment, the 

conjugate contains a) the MTSP or portion thereof, as described herein; 
and b) a targeting agent linked to the MTSP protein directly or via a linker. 

Conjugates, such as fusion proteins and chemical conjugates, of 
the MTSP protein with a protein or peptide fragment (or plurality thereof) 

20 that functions, for example, to facilitate affinity isolation or purification of 
the MTSP protein domain, attachment of the MTSP protein domain to a 
surface, or detection of the MTSP protein domain are provided. The 
conjugates can be produced by chemical conjugation, such as via thiol 
linkages, but are preferably produced by recombinant means as fusion 

25 proteins. In the fusion protein, the peptide or fragment thereof is linked 
to either the N-terminus or C-terminus of the MTSP protein domain. In 
chemical conjugates the peptide or fragment thereof may be linked 
anywhere that conjugation can be effected, and there may be a plurality 
of such peptides or fragments linked to a single MTSP protein domain or 

30 to a plurality thereof. 
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The targeting agent is preferably for in vitro delivery to a cell or 
tissue, and includes agents such as cell or tissue-specific antibodies, 
growth factors and other factors expressed on specific cells; and other 
cell or tissue specific agents the promote directed delivery of a linked 
protein. 

Most preferably the targeting agent specifically delivers the MTSP 
protein to selected cells by interaction with a ceil surface protein and 
internalization of conjugate or MTSP protein portion thereof. These 
conjugate are used in a variety of methods and are particularly suited for 
use in methods of activation of prodrugs, such as prodrugs that upon 
cleavage by the particular MTSP protein are cytotoxic. The prodrugs are 
administered prior to simultaneously with or subsequently to the 
conjugate. Upon delivery to the targeted cells, the protease activates the 
prodrug, which then exhibits is therapeutic effect, such as a cytotoxic 
effect. 

1 , Conjugation 

Conjugates with linked MTSP protein domains can be prepared 
either by chemical conjugation, recombinant DNA technology, or 
combinations of recombinant expression and chemical conjugation. The 
MTSP protein domains and the targeting agent may be linked in any 
orientation and more than one targeting agents and/or MTSP protein 
domains may be present in a conjugate. 

a. Fusion proteins 

Fusion proteins are proved herein. A fusion protein contains: a) 
one or a plurality of domains of an MTSP proteins and b) a targeting 
agent. The fusion proteins are preferably produced by recombinant 
expression of nucleic acids that encode the fusion protein. 
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b. Chemical conjugation 

To effect chemical conjugation herein, the MTSP protein domain is 
linked via one or more selected linkers or directly to the targeting agent. 
Chemical conjugation must be used if the targeted agent is other than a 
5 peptide or protein, such a nucleic acid or a non-peptide drug. Any means 
known to those of skill in the art for chemically conjugating selected 
moieties may be used. 

2. Linkers 

Linkers for two purposes are contemplated herein. The conjugates 
10 may include one or more linkers between the MTSP protein portion and 
the targeting agent. Additionally, linkers are used for facilitating or 
enhancing immobilization of an MTSP protein or portion thereof on a solid 
support, such as a microtiter plate, silicon or silicon-coated chip, glass or 
plastic support, such as for high throughput solid phase screening 
15 protocols. 

Any linker known to those of skill in the art for preparation of 
conjugates may be used herein. These linkers are typically used in the 
preparation of chemical conjugates; peptide linkers may be incorporated 
into fusion proteins. 

20 Linkers can be any moiety suitable to associate a domain of MTSP 

protein and a targeting agent. Such linkers and linkages include, but are 
not limited to, peptidic linkages, amino acid and peptide linkages, typically 
containing between one and about 60 amino acids, more generally 
between about 10 and 30 amino acids, chemical linkers, such as 

25 heterobifunctional cleavable cross-linkers, including but are not limited to^ 
N-succinimidyl {4-iodoacetyl)-aminobenzoate, sulfosuccinimydil {4- 
iodoacetyO-aminobenzoate, 4-succinimidyl-oxycarbonyl-a- (2- 
pyridyldithio)toluene, sulfosuccinimidyl-6- [a-methyl-a-(pyridyldithiol)- 
toluamido] hexanoate, N-succinimidyl-3-(-2-pyridyldithio) - proprionate, 

30 succinimidyl 6[3{-(-2-pyridyldithio)-proprionamido] hexanoate, 

sulfosuccinimidyl 6[3(-(-2-pyridyldithio)-propionamido] hexanoate, 3-(2- 
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pyridyldithio)-propionyl hydrazide, Ellman's reagent, dichlorotriazinic acid, 
and S-(2-thiopyridyl)-L-cysteine. Other linkers include, but are not limited 
to peptides and other moieties that reduce stearic hindrance between the 
domain of MTSP protein and the targeting agent, intracellular enzyme 
5 substrates, linkers that increase the flexibility of the conjugate, linkers 
that increase the solubility of the conjugate, linkers that increase the 
serum stability of the conjugate, photocleavable linkers and acid cleavable 
linkers. 



10 chemically linked conjugates include, but are not limited to, disulfide 
bonds, thioether bonds, hindered disulfide bonds, and covalent bonds 
between free reactive groups, such as amine and thiol groups. These 
bonds are produced using heterobifunctional reagents to produce reactive 
thiol groups on one or both of the polypeptides and then reacting the thiol 

15 groups on one polypeptide with reactive thiol groups or amine groups to 
which reactive maleimido groups or thiol groups can be attached on the 
other. Other linkers include, acid cleavable linkers, such as 
bismaleimideothoxy propane, acid labile-transferrin conjugates and adipic 
acid diihydrazide, that would be cleaved in more acidic intracellular 



20 compartments; cross linkers that are cleaved upon exposure to UV or 
visible light and linkers, such as the various domains, such as 0^1, Ch2, 
and Ch3, from the constant region of human IgG^ (see, Batra et al. 
Molecular Immunol., 30:379-386 (1993)). In some embodiments, several 
linkers may be included in order to take advantage of desired properties of 

25 each linker. 

Chemical linkers and peptide linkers may be inserted by covalently 
coupling the linker to the domain of MTSP protein and the targeting 
agent. The heterobifunctional agents, described below, may be used to 
effect such covalent coupling. Peptide linkers may also be linked by 
30 expressing DNA encoding the linker and TA, linker and targeted agent, or 
linker, targeted agent and TA as a fusion protein. Flexible linkers and 



Other exemplary linkers and linkages that are suitable for 
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linkers that increase solubility of the conjugates are contemplated for use, 

either alone or with other linkers are also contemplated herein. 

a) Acid cleavable, photocleavabie and heat sensitive 
linkers 

5 Acid cleavable linkers, photocleavabie and heat sensitive linkers 

may also be used, particularly where it may be necessary to cleave the 
domain of MTSP protein to permit it to be more readily accessible to 
reaction. Acid cleavable linkers include, but are not limited to, bismaleimi- 
deothoxy propane; and adipic acid dihydrazide linkers (see, e.g,, Fattom 

10 et al, (1992) Infection & Immun. 50:584-589) and acid labile transferrin 
conjugates that contain a sufficient portion of transferrin to permit entry 
into the intracellular transferrin cycling pathway (see, e.g., Weihoner et al. 
(1991) J. Biol. Chem. 265:4309-4314). 

Photocleavabie linkers are linkers that are cleaved upon exposure to 

15 light (see, e.g., Goidmacher et aL (1992) Bioconj. Chem. 5:104-107, 

which linkers are herein incorporated by reference), thereby releasing the 
targeted agent upon exposure to light. Photocleavabie linkers that are 
cleaved upon exposure to light are known (see, e.g., Hazum et al. (1981) 
in Pept., Proc. Eur. Pept. Symp,, 16th, Brunfeldt, K (Ed), pp. 105-110, 

20 which describes the use of a nitrobenzyl group as a photocleavabie 

protective group for cysteine; Yen et a/. (1989) Makromol. Chem 790:69- 
32, which describes water soluble photocleavabie copolymers, including 
hydroxypropylmethacrylamide copolymer, glycine copolymer, fluorescein 
copolymer and methylrhodamine copolymer; Goidmacher et al. (1992) 

25 Bioconj. Chem. 5:104-107, which describes a cross-linker and reagent 
that undergoes photolytic degradation upon exposure to near UV light 
(350 nm); and Senter et al. (1985) Photochem. Photobiol 42\23^'221 , 
which describes nitrobenzyloxycarbonyl chloride cross linking reagents 
that produce photocleavabie linkages), thereby releasing the targeted 

30 agent upon exposure to light. Such linkers would have particular use in 
treating dermatological or ophthalmic conditions that can be exposed to 
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light using fiber optics. After administration of the conjugate, the eye or 
skin or other body part can be exposed to light, resulting in release of the 
targeted nnoiety from the conjugate. Such photocleavable linkers are 
useful in connection with diagnostic protocols in which it is desirable to 
5 remove the targeting agent to permit rapid clearance from the body of the 
animal. 

b) Other linkers for chemical conjugation 

Other linkers, include trityl linkers, particularly, derivatized 
trityl groups to generate a genus of conjugates that provide for 

10 release of therapeutic agents at various degrees of acidity or alkalinity. 
The flexibility thus afforded by the ability to preselect the pH range at 
which the therapeutic agent will be released allows selection of a linker 
based on the known physiological differences between tissues in need of 
delivery of a therapeutic agent (see, e.g.^ U.S. Patent No. 5,612,474). 

15 For example, the acidity of tumor tissues appears to be lower than that of 
normal tissues. 

c) Peptide linkers 

The linker moieties can be peptides. Peptide linkers can be 
employed in fusion proteins and also in chemically linked conjugates. The 

20 peptide typically has from about 2 to about 60 amino acid residues, for 
example from about 5 to about 40, or from about 10 to about 30 amino 
acid residues. The length selected will depend upon factors, such as the 
use for which the linker is included. 

Peptide linkers are advantageous when the targeting agent is 

25 proteinaceous. For example, the linker moiety can be a flexible spacer 
amino acid sequence, such as those known in single-chain antibody 
research. Examples of such known linker moieties include, but are not 
limited to, peptides, such as {Gly^Ser)^ and (Ser^Gly)^, in which n is 1 to 
6, preferably 1 to 4, more preferably 2 to 4, and m is 1 to 6, preferably 1 

30 to 4, more preferably 2 to 4, enzyme cleavable linkers and others. 
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Additional linking moieties are described, for example, in Huston et 
aL, Proc. Natl. Acad. Sci. U.S.A. S5:5879-5883, 1988; Whitlow, M., et 
a/., Protein Engineering 5:989-995, 1993; Newton eta/., Biocfiemistry 
35:545-553, 1996; A. J. Cumber eta/., Bioconj. Chem. 5:397-401, 
5 1992; Ladurner et a!., J. IVIol. Biol. 273:330-337, 1997; and U.S. Patent. 
No. 4,894,443. In some embodiments, several linkers may be included in 
order to take advantage of desired properties of each linker. 

3. Targeting agents 

Any agent that facilitates detection, immobilization, or purification 
10 of the conjugate is contemplated for use herein. For chemical conjugates 
any moiety that has such properties is contemplated; for fusion proteins, 
the targeting agent is a protein, peptide or fragment thereof that sufficient 
to effects the targeting activity. Preferred targeting agents are those that 
deliver the MTSP protein or portion thereof to selected cells and tissues. 
15 Such agents include tumor specific monoclonal antibodies and portions 
thereof, growth factors, such as FGF, EGF, PDGF, VEGF, cytokines, 
including chemokines, and other such agents. 

4. Nucleic acids, plasmids and cells 

Isolated nucleic acid fragments encoding fusion proteins are 
20 provided. The nucleic acid fragment that encodes the fusion protein 

includes: a) nucleic acid encoding a protease domain of an MTSP protein 
encoded by a nucleic acid that hybridizes to a nucleic acid having the 
nucleotide sequence set forth in the SEQ. ID N0:1; and b) nucleic acid 
encoding a protein, peptide or effective fragment thereof that facilitates: i) 
25 affinity isolation or purification of the fusion protein; ii) attachment of the 
fusion protein to a surface; or iii) detection of the fusion protein. 
Preferably, the nucleic acid is DNA.. 

Plasmids for replication and vectors for expression that contain the 
above nucleic acid fragments are also provided. Cells containing the 
30 plasmids and vectors are also provided. The cells can be any suitable 
host including, but are not limited to, bacterial cells, yeast cells, fungal 
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cells, plant cells, insect cell and animal cells. The nucleic acids, plasmids, 
and cells containing the plasmids can be prepared according to methods 
known in the art including any described herein. 

Also provided are methods for producing the above fusion proteins. 
5 An exemplary method includes the steps of growing, i.e. culturing the 
cells so that the proliferate, cells containing a plasmid encoding the fusion 
protein under conditions whereby the fusion protein is expressed by the 
cell, and recovering the expressed fusion protein. Methods for expressing 
and recovering recombinant proteins are well known in the art {See 

10 generally. Current Protocols in Molecular Biology (1998) § 16, John Wiley 
& Sons, Inc.) and such methods can be used for expressing and 
recovering the expressed fusion proteins. Preferably, the recombinant 
expression and recovery methods disclosed in Section B can be used. 
The recovered fusion proteins can be isolated or purified by 

15 methods known in the art such as centrifugation, filtration, 

chromatograph, electrophoresis, immunoprecipitation, etc., or by a 
combination thereof [See generally^ Current Protocols in Molecular Biology 
(1998) § 10, John Wiley & Sons, Inc.). Preferably, the recovered fusion 
protein is isolated or purified through affinity binding between the protein 

20 or peptide fragment of the fusion protein and an affinity binding moiety. 
As discussed in the above sections regarding the construction of the 
fusion proteins, any affinity binding pairs can be constructed and used in 
the isolation or purification of the fusion proteins. For example, the 
affinity binding pairs can be protein binding sequences/protein, DNA 

25 binding sequences/DNA sequences, RNA binding sequences/RNA 
sequences, lipid binding sequences/lipid, polysaccharide binding 
sequences/polysaccharide, or metal binding sequences/metal. 
5. Immobilization and supports or substrates therefor 
In certain embodiments, where the targeting agents are designed 

30 for linkage to surfaces, the MTSP protein can be attached by linkage such 
as ionic or covalent, non-covalent or other chemical interaction, to a 
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surface of a support or matrix material. Immobilization may be effected 
directly or via a linker. The MTSP protein may be immobilized on any 
suitable support, including, but are not limited to, silicon chips, and other 
supports described herein and known to those of skill in the art. A 
5 plurality of MTSP protein or protease domains thereof may be attached to 
a support, such as an array {Ae., a pattern of two or more) of conjugates 
on the surface of a silicon chip or other chip for use in high throughput 
protocols and formats. 

It is also noted that the domains of the MTSP protein can be linked 
10 directly to the surface or via a linker without a targeting agent linked 
thereto. Hence chips containing arrays of the domains of the MTSP 
protein. 

The matrix material or solid supports contemplated herein are 
generally any of the insoluble materials known to those of skill in the art 

15 to immobilize ligands and other molecules, and are those that used in 

many chemical syntheses and separations. Such supports are used, for 
example, in affinity chromatography, in the immobilization of biologically 
active materials, and during chemical syntheses of biomolecuies, including 
proteins, amino acids and other organic molecules and polymers. The 

20 preparation of and use of supports is well known to those of skill in this 
art; there are many such materials and preparations thereof known. For 
example, naturally-occurring support materials, such as agarose and 
cellulose, may be isolated from their respective sources, and processed 
according to known protocols, and synthetic materials may be prepared in 

25 accord with known protocols. 

The supports are typically insoluble materials that are solid, porous, 
deformable, or hard, and have any required structure and geometry, inclu- 
ding, but not limited to: beads, pellets, disks, capillaries, hollow fibers, 
needles, solid fibers, random shapes, thin films and membranes. Thus, 

30 the item may be fabricated from the matrix material or combined with it, 
such as by coating all or part of the surface or impregnating particles. 
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Typically, when the matrix is particulate, the particles are at least 
about 10-2000 fjM, but may be smaller or larger, depending upon the 
selected application. Selection of the matrices will be governed, at least 
in part, by their physical and chemical properties, such as solubility, 
5 functional groups, mechanical stability, surface area swelling propensity, 
hydrophobic or hydrophilic properties and intended use. 

If necessary, the support matrix material can be treated to contain 
an appropriate reactive moiety. In some cases, the support matrix 
material already containing the reactive moiety may be obtained 

10 commercially. The support matrix material containing the reactive moiety 
may thereby serve as the matrix support upon which molecules are 
linked. Materials containing reactive surface moieties such as amino 
silane linkages, hydroxyl linkages or carboxysilane linkages may be 
produced by well established surface chemistry techniques involving 

15 silanization reactions, or the like. Examples of these materials are those 
having surface silicon oxide moieties, covalently linked to gamma-amino- 
propylsilane, and other organic moieties; N-[3-(triethyoxysilyl)propyl]- 
phthelamic acid; and bis-{2-hydroxyethyl)aminopropyltriethoxysilane. 
Exemplary of readily available materials containing amino group reactive 

20 functionalities, include, but are not limited to, para-aminophenyl- 

triethyoxysilane. Also derivatized polystyrenes and other such polymers 
are well known and readily available to those of skill in this art (e.g.r the 
Tentagel® Resins are available with a multitude of functional groups, and 
are sold by Rapp Polymere, Tubingen, Germany; see, U.S. Patent No. 

25 4,908,405 and U.S. Patent No. 5,292,814; see, also Butz et al.. Peptide 
Res., 7:20-23 (1994); and Kleine et al., ImmunobtoL, 190:53-66 (1994)). 

These matrix materials include any material that can act as a 
support matrix for attachment of the molecules of interest. Such 
materials are known to those of skill in this art, and include those that are 

30 used as a support matrix. These materials include, but are not limited to, 
inorganics, natural polymers, and synthetic polymers, including, but are 
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not limited to: cellulose, cellulose derivatives, acrylic resins, glass, silica 
gels, polystyrene, gelatin, polyvinyl pyrrolidone, co-polymers of vinyl and 
acrylamide, polystyrene cross-linked with divinylbenzene and others {see, 
Merrifield, Biochemistry ^ 3:1385-1390 (1964)), polyacrylamides, latex 
5 gels, polystyrene, dextran, poiyacrylamides, rubber, silicon, plastics, 
nitrocellulose, celluloses, natural sponges. Of particular interest herein, 
are highly porous glasses {see, e.g., U.S. Patent No. 4,244,721) and 
others prepared by mixing a borosilicate, alcohol and water. 

Synthetic supports include, but are not limited to: acrylamides, 

10 dextran-derivatives and dextran co-polymers, agarose-polyacrylamide 
blends, other polymers and co-polymers with various functional groups, 
methacrylate derivatives and co-polymers, polystyrene and polystyrene 
copolymers [see, e.g., Merrifield, Biochemistry, 3:1385-1390 (1964); 
Berg et al., in Innovation Perspect. Solid Phase Synth. Collect. Pap., Int. 

15 Symp., 1st, Epton, Roger (Ed), pp. 453-459 (1990); Berg et al., Pept, 
Proc, Eur. Pept. Symp., 20th, Jung, G. et (Eds), pp. 196-198 (1989); 
Berg et al., J, Am. Chem. Soc, 1 1 1 :8024-8026 (1989); Kent et al., Isr. 
J. Chem., 17:243-247 (1979); Kent et al., J. Org. Chem., 43:2845-2852 
(1978); Mitchell et al.. Tetrahedron Lett., 42:3795-3798 (1976); U.S. 

20 Patent No. 4,507,230; U.S. Patent No. 4,006,1 17; and U.S. Patent No. 
5,389,449). Such materials include those made from polymers and co- 
polymers such as polyvinylalcohols, acrylates and acrylic acids such as 
polyethylene-co-acrylic acid, polyethylene-co-methacrylic acid, polyethy- 
lene-co-ethylacrylate, polyethylene-co-methyl acrylate, polypropylene-co- 

25 acrylic acid, polypropylene-co-methyl-acrylic acid, polypropylene-co-ethyl- 
acrylate, polypropyiene-co-methyl acrylate, polyethylene-co-vinyl acetate, 
polypropylene-co-vinyl acetate, and those containing acid anhydride 
groups such as polyethylene-co-maleic anhydride and polypropytene-co- 
maleic anhydride. Liposomes have also been used as solid supports for 

30 affinity purifications (Powell et al. Biotechnol. Bioeng., 33:173 (1989)). 
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Numerous methods have been developed for the immobilization of 
proteins and other biomolecules onto solid or liquid supports {see, e,g., 
Mosbach, Methods in Enzymology, 44 (1976); Weetall, Immobilized 
Enzymes, Antigens, Antibodies, and Peptides ^ (1975); Kennedy et al., 
5 Solid Phase Biochemistry, Analytical and Synthetic Aspects, Scouten, ed., 
pp. 253-391 (1983); see, generally. Affinity Techniques. Enzyme 
Purification: Part B. Methods in Enzymology, Vol. 34, ed. W. B. Jakoby, 
M. Wilchek, Acad. Press, N.Y. (1974); and Immobilized Biochemicals and 
Affinity Chromatography, Advances in Experimental Medicine and 

10 Biology, vol. 42, ed. R. Dunlap, Plenum Press, N.Y. (1974)). 

Among the most commonly used methods are absorption and ad- 
sorption or covalent binding to the support, either directly or via a linker, 
such as the numerous disulfide linkages, thioether bonds, hindered 
disulfide bonds, and covalent bonds between free reactive groups, such 

15 as amine and thiol groups, known to those of skill in art (see, e.g., the 
PIERCE CATALOG, ImmunoTechnology Catalog & Handbook, 1992-1993, 
which describes the preparation of and use of such reagents and provides 
a commercial source for such reagents; Wong, Chemistry of Protein 
Conjugation and Cross Linking, CRC Press (1993); see also DeWitt et al., 

20 Proc. NatL Acad. Sci. U.S.A., 90:6909 (1993); Zuckermann et a!., J. Am. 
Chem. Soc, 1 14 :10646 (1992); Kurth et al., J. Am. Chem. Soc, 
1 16 :2661 (1994); Ellman et al., Proc. Natl. Acad. Sci. U.S. A,, 91:4708 
(1994).; Sucholeiki, Tetrahedron Lttrs., 35:7307 (1994); Su-Sun Wang, J. 
Org. Chem., 41:3258 (1976); Padwa et al., J. Org. Chem., 41=3550 

25 (1971); and Vedejs et al., J. Org. Chem., 49:575 (1984), which describe 
photosensitive linkers). 

To effect immobilization, a composition containing the protein or 
other biomolecule is contacted with a support material such as alumina, 
carbon, an ion-exchange resin, cellulose, glass or a ceramic. 

30 Fluorocarbon polymers have been used as supports to which biomolecules 
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have been attached by adsorption (see, U.S. Patent No. 3,843,443; 
Published International PCT Application WO/86 03840). 
J. Prognosis and diagnosis 

MTSP protein proteins, domains, analogs, and derivatives thereof, 
5 and encoding nucleic acids (and sequences conaplementary thereto), and 
anti-MTSP protein antibodies, can be used in diagnostics. Such 
molecules can be used in assays, such as immunoassays, to detect, 
prognose, diagnose, or monitor various conditions, diseases, and 
disorders affecting MTSP protein expression, or monitor the treatment 

10 thereof. For purposes herein, the presence of MTSPs in body fluids or 
tumor tissues are of particular interest. 

In particular, such an immunoassay is carried out by a method 
including contacting a sample derived from a patient with an anti-MTSP 
protein antibody under conditions such that specific binding can occur, 

15 and detecting or measuring the amount of any specific binding by the 
antibody. In a specific aspect, such binding of antibody, in tissue 
sections, can be used to detect aberrant MTSP protein localization or 
aberrant (e.g., low or absent) levels of MTSP protein. In a specific 
embodiment, antibody to MTSP protein can be used to assay in a patient 

20 tissue or serum sample for the presence of MTSP protein where an 

aberrant level of MTSP protein is an indication of a diseased condition. 

The immunoassays which can be used include but are not limited 
to competitive and non-competitive assay systems using techniques such 
as western blots, radioimmunoassays, ELISA (enzyme linked 

25 immunosorbent assay), "sandwich" immunoassays, immunoprecipitation 
assays, precipitin reactions, gel diffusion precipitin reactions, 
immunodiffusion assays, agglutination assays, complement-fixation 
assays, immunoradiometric assays, fluorescent immunoassays, protein A 
immunoassays, to name but a few. 

30 MTSP protein genes and related nucleic acid sequences and 

subsequences, including complementary sequences, can also be used in 



-136- 




hybridization assays. MTSP protein nucleic acid sequences, or 
subsequences thereof containing about at least 8 nucleotides, preferably 
14 or 1 6 or 30 or more continugous nucleotides, can be used as 
hybridization probes. Hybridization assays can be used to detect, 
5 prognose, diagnose, or monitor conditions, disorders, or disease states 
associated with aberrant changes in MTSP protein expression and/or 
activity as described herein. In particular, such a hybridization assay is 
carried out by a method by contacting a sample containing nucleic acid 
with a nucleic acid probe capable of hybridizing to MTSP protein encoding 

10 DNA or RNA, under conditions such that hybridization can occur, and 
detecting or measuring any resulting hybridization. 

In a specific embodiment, a method of diagnosing a disease or 
disorder characterized by detecting an aberrant level of an MTSP protein 
in a subject is provided herein by measuring the level of the DNA, RNA, 

15 protein or functional activity of the epithelial MTSP protein at least 
partially encoded by a nucleic acid that hybridizes to a nucleic acid 
having the nucleotide sequence set forth in the SEQ. ID N0:1 in a sample 
derived from the subject, wherein an increase or decrease in the level of 
the DNA, RNA, protein or functional activity of the MTSP protein, relative 

20 to the level of the DNA, RNA, protein or functional activity found in an 
analogous sample not having the disease or disorder indicates the 
presence of the disease or disorder in the subject. 

Kits for diagnostic use are also provided, that contain in one or 
more containers an anti-MTSP protein antibody, particularly anti-MTSP3 

25 or anti = MTSP4, and, optionally, a labeled binding partner to the 

antibody. Alternatively, the anti-MTSP protein antibody can be labeled 
(with a detectable marker, e,g,, a chemiluminescent, enzymatic, 
fluorescent, or radioactive moiety). A kit is also provided that includes in 
one or more containers a nucleic acid probe capable of hybridizing to 

30 MTSP protein-encoding RNA. In a specific embodiment, a kit can 

comprise in one or more containers a pair of primers {e.g., each in the 
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size range of 6-30 nucleotides) that are capable of priming amplification 

[e.g., by polymerase chain reaction (see e.g., Innis et al., 1990, PCR 

Protocols, Academic Press, Inc., San Diego, CA), ligase chain reaction 

(see EP 320,308) use of (1/3 replicase, cyclic probe reaction, or other 

5 methods known in the art under appropriate reaction conditions of at least 

a portion of an MTSP protein-encoding nucleic acid. A kit can optionally 

further comprise in a container a predetermined amount of a purified 

MTSP protein or nucleic acid, e.g.r for use as a standard or control. 

K. PHARMACEUTICAL COMPOSITIONS AND MODES OF 
10 ADMINISTRATION 

1 . Components of the compositions 

Pharmaceutical compositions containing the identified compounds 
that modulate the activity of an MTSP protein are provided herein. Also 
provided are combinations of a compound that modulates the activity of 

15 an MTSP protein and another treatment or compound for treatment of a 
neoplastic disorder, such as a chemotherapeutic compound. 

The MTSP protein modulator and the anti-tumor agent can be 
packaged as separate compositions for administration together or 
sequentially or intermittently. Alternatively, they can provided as 

20 a single composition for administration or as two compositions for 
administration as a single composition. The combinations can be 
packaged as kits. 

a. MTSP protein inhibitors 
Any MTSP protein inhibitors, including those described herein when 

25 used alone or in combination with other compounds, that can alleviate, 

reduce, ameliorate, prevent, or place or maintain in a state of remission of 
clinical symptoms or diagnostic markers associated with neoplastic 
diseases, including undesired and/or uncontrolled angiogenesis, can be 
used in the present combinations. 

30 In one embodiment, the MTSP protein inhibitor is an antibody or 

fragment thereof that specifically reacts with an MTSP protein or the 
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protease domain thereof, an inhibitor of the MTSP protein production, an 
inhibitor of the epithelial MTSP protein membrane-localization, or any 
inhibitor of the expression of or, especially, the activity of an MTSP 
protein. 

5 b. Anti-angiogenic agents and anti-tumor agents 

Any anti-angiogenic agents and anti-tumor agents, including those 
described herein, when used alone or in combination with other 
compounds, that can alleviate, reduce, ameliorate, prevent, or place or 
maintain in a state of remission of clinical symptoms or diagnostic 
10 markers associated with undesired and/or uncontrolled angiogenesis 
and/or tumor growth and metastasis, particularly solid neoplasms, 
"t: vascular malformations and cardiovascular disorders, chronic 

H inflammatory diseases and aberrant wound repairs, circulatory disorders, 

crest syndromes, dermatological disorders, or ocular disorders, can be 
\^ 15 used in the combinations. Also contemplated are anti-tumor agents for 

M use in combination with an inhibitor of an MTSP protein. 

c. Anti-tumor agents and anti-angiogenic agents 
y The compounds identified by the methods provided herein or 

1J provided herein can be used in combination with anti-tumor agents and/or 

!; 20 anti-angiogenesis agents, 

2. Formulations and route of administration 
The compounds herein and agents are preferably formulated as 
pharmaceutical compositions, preferably for single dosage administration. 
The concentrations of the compounds in the formulations are effective for 
25 delivery of an amount, upon administration, that is effective for the 

intended treatment. Typically, the compositions are formulated for single 
dosage administration. To formulate a composition, the weight fraction 
of a compound or mixture thereof is dissolved, suspended, dispersed or 
otherwise mixed in a selected vehicle at an effective concentration such 
30 that the treated condition is relieved or ameliorated. Pharmaceutical 

carriers or vehicles suitable for administration of the compounds provided 
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herein include any such carriers known to those skilled in the art to be 
suitable for the particular mode of administration. 

In addition, the compounds may be formulated as the sole 
pharmaceuticaily active ingredient in the composition or may be combined 
5 with other active ingredients. Liposomal suspensions, including tissue- 
targeted liposomes, may also be suitable as pharmaceuticaily acceptable 
carriers. These may be prepared according to methods known to those 
skilled in the art. For example, liposome formulations may be prepared as 
described in U.S. Patent No. 4,522,811. 
10 The active compound is included in the pharmaceuticaily 

acceptable carrier in an amount sufficient to exert a therapeutically useful 
effect in the absence of undesirable side effects on the patient treated. 
The therapeutically effective concentration may be determined empirically 
by testing the compounds in known in vitro and in vivo systems, such as 

15 the assays provided herein. 

The concentration of active compound in the drug composition will 
depend on absorption, inactivation and excretion rates of the active 
compound, the physicochemical characteristics of the compound, the 
dosage schedule, and amount administered as well as other factors 

20 known to those of skill in the art. 

Typically a therapeutically effective dosage is contemplated. The 
amounts administered may be on the order of 0.001 to 1 mg/ml, 
preferably about 0.005-0.05 mg/ml, more preferably about 0.01 mg/ml, 
of blood volume. Pharmaceutical dosage unit forms are prepared to 

25 provide from about 1 mg to about 1000 mg and preferably from about 10 
to about 500 mg, more preferably about 25-75 mg of the essential active 
ingredient or a combination of essential ingredients per dosage unit form, 
jhe precise dosage can be empirically determined. 

The active ingredient may be administered at once, or may be 

30 divided into a number of smaller doses to be administered at intervals of 
time. It is understood that the precise dosage and duration of treatment 
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is a function of the disease being treated and may be determined 
empirically using known testing protocols or by extrapolation from in vivo 
or jn vitro test data. It is to be noted that concentrations and dosage 
values may also vary with the severity of the condition to be alleviated. It 
5 is to be further understood that for any particular subject, specific dosage 
regimens should be adjusted over time according to the individual need 
and the professional judgment of the person administering or supervising 
the administration of the compositions, and that the concentration ranges 
set forth herein are exemplary only and are not intended to limit the scope 

10 or use of the claimed compositions and combinations containing them. 

Preferred pharmaceutically acceptable derivatives include acids, 
salts, esters, hydrates, solvates and prodrug forms. The derivative is 
typically selected such that its pharmacokinetic properties are superior to 
the corresponding neutral compound. 

15 Thus, effective concentrations or amounts of one or more of the 

compounds provided herein or pharmaceutically acceptable derivatives 
thereof are mixed with a suitable pharmaceutical carrier or vehicle for 
systemic, topical or local administration to form pharmaceutical 
compositions. Compounds are included in an amount effective for 

20 ameliorating or treating the disorder for which treatment is contemplated. 
The concentration of active compound in the composition will depend on 
absorption, inactivation, excretion rates of the active compound, the 
dosage schedule, amount administered, particular formulation as well as 
other factors known to those of skill in the art. 

25 Solutions or suspensions used for parenteral, intradermal, 

subcutaneous, or topical application can include any of the following 
components: a sterile diluent, such as water for injection, saline solution, 
fixed oil, polyethylene glycol, glycerine, propylene glycol or other 
synthetic solvent; antimicrobial agents, such as benzyl alcohol and methyl 

30 parabens; antioxidants, such as ascorbic acid and sodium bisulfite; 
chelating agents, such as ethylenediaminetetraacetic acid (EDTA); 
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buffers, such as acetates, citrates and phosphates; and agents for the 
adjustment of tonicity such as sodium chloride or dextrose. Parenteral 
preparations can be enclosed in ampules, disposable syringes or single or 
multiple dose vials made of glass, plastic or other suitable material. 
5 In instances in which the compounds exhibit insufficient solubility, 

methods for solubilizing compounds may be used. Such methods are 
known to those of skill in this art, and include, but are not limited to, 
using cosolvents, such as dimethylsulfoxide (DMSO), using surfactants, 
such as Tween®, or dissolution in aqueous sodium bicarbonate. 

10 Derivatives of the compounds, such as prodrugs of the compounds may 
also be used in formulating effective pharmaceutical compositions. For 
ophthalmic indications, the compositions are formulated in an 
ophthalmically acceptable carrier. For the ophthalmic uses herein, local 
administration, either by topical administration or by injection is preferred. 

15 Time release formulations are also desirable. Typically, the compositions 
are formulated for single dosage administration, so that a single dose 
administers an effective amount. 

Upon mixing or addition of the compound with the vehicle, the 
resulting mixture may be a solution, suspension, emulsion or other 

20 composition. The form of the resulting mixture depends upon a number 
of factors, including the intended mode of administration and the 
solubility of the compound in the selected carrier or vehicle. If necessary, 
pharmaceutically acceptable salts or other derivatives of the compounds 
are prepared. 

25 The compound is included in the pharmaceutically acceptable 

carrier in an amount sufficient to exert a therapeutically useful effect in 
the absence of undesirable side effects on the patient treated. It is 
understood that number and degree of side effects depends upon the 
condition for which the compounds are administered. For example, 

30 certain toxic and undesirable side effects are tolerated when treating life- 
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threatening illnesses that would not be tolerated when treating disorders 

of lesser consequence. 

The compounds can also be mixed with other active materials, that 
do not impair the desired action, or with materials that supplement the 
5 desired action known to those of skill in the art. The formulations of the 
compounds and agents for use herein include those suitable for oral, 
rectal, topical, inhalational, buccal (e.g., sublingual), parenteral (e.g., 
subcutaneous, intramuscular, intradermal, or intravenous), transdermal 
administration or any route. The most suitable route in any given case will 

10 depend on the nature and severity of the condition being treated and on 
the nature of the particular active compound which is being used. The 
formulations are provided for administration to humans and animals in unit 
dosage forms, such as tablets, capsules, pills, powders, granules, sterile 
parenteral solutions or suspensions, and oral solutions or suspensions, 

15 and oil-water emulsions containing suitable quantities of the compounds 
or pharmaceutically acceptable derivatives thereof. The pharmaceutically 
therapeutically active compounds and derivatives thereof are typically 
formulated and administered in unit-dosage forms or multiple-dosage 
forms. Unit-dose forms as used herein refers to physically discrete units 

20 suitable for human and animal subjects and packaged individually as is 
known in the art. Each unit-dose contains a predetermined quantity of 
the therapeutically active compound sufficient to produce the desired 
therapeutic effect, in association with the required pharmaceutical carrier, 
vehicle or diluent. Examples of unit-dose forms include ampoules and 

25 syringes and individually packaged tablets or capsules. Unit-dose forms 
may be administered in fractions or multiples thereof. A multiple-dose 
form is a plurality of identical unit-dosage forms packaged in a single 
container to be administered in segregated unit-dose form. Examples of 
multiple-dose forms include vials, bottles of tablets or capsules or bottles 

30 of pints or gallons. Hence, multiple dose form is a multiple of unit-doses 
which are not segregated in packaging. 
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The composition can contain along with the active ingredient: a 
diluent such as lactose, sucrose, dicalcium phosphate, or carboxymethyi- 
celiulose; a lubricant, such as magnesium stearate, calcium stearate and 
talc; and a binder such as starch, natural gums, such as gum 
5 acaciagelatin, glucose, molasses, polvinylpyrrolidine, celluloses and 
derivatives thereof, povidone, crospovidones and other such binders 
known to those of skill in the art. Liquid pharmaceutically administrable 
compositions can, for example, be prepared by dissolving, dispersing, or 
otherwise mixing an active compound as defined above and optional 

10 pharmaceutical adjuvants in a carrier, such as, for example, water, saline, 
aqueous dextrose, glycerol, glycols, ethanol, and the like, to thereby form 
a solution or suspension. If desired, the pharmaceutical composition to 
be administered may also contain minor amounts of nontoxic auxiliary 
substances such as wetting agents, emulsifying agents, or solubilizing 

15 agents, pH buffering agents and the like, for example, acetate, sodium 
citrate, cyclodextrine derivatives, sorbitan monolaurate, triethanolamine 
sodium acetate, triethanolamine oleate, and other such agents. Methods 
of preparing such dosage forms are known, or will be apparent, to those 
skilled in this art (see, e.g., Remington's Pharmaceutical Sciences, Mack 

20 Publishing Company, Easton, Pa., 15th Edition, 1975). The composition 
or formulation to be administered will contain a quantity of the active 
compound in an amount sufficient to alleviate the symptoms of the 
treated subject. 

Dosage forms or compositions containing active ingredient in the 
25 range of 0.005% to 100% with the balance made up from non-toxic 
carrier may be prepared. For oral administration, the pharmaceutical 
compositions may take the form of, for example, tablets or capsules 
prepared by conventional means with pharmaceutically acceptable 
excipients such as binding agents (e,g,, pregelatinized maize starch, 
30 polyvinyl pyrrolidone or hydroxypropyl methylcellulose); fillers {e.g., 
lactose, microcrystalline cellulose or calcium hydrogen phosphate); 
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lubricants [e.g., magnesiunn stearate, talc or silica); disintegrants [e.g., 
potato starch or sodium starch glycolate); or wetting agents (e.g., sodium 
lauryl sulphate). The tablets may be coated by methods well-known in 
the art. 

5 The pharmaceutical preparation may also be in liquid form, for 

example, solutions, syrups or suspensions, or may be presented as a drug 
product for reconstitution with water or other suitable vehicle before use. 
Such liquid preparations may be prepared by conventional means with 
pharmaceutically acceptable additives such as suspending agents (e.g., 

10 sorbitol syrup, cellulose derivatives or hydrogenated edible fats); 

emulsifying agents (e.g., lecithin or acacia); non-aqueous vehicles (e.g., 
almond oil, oily esters, or fractionated vegetable oils); and preservatives 
(e.g., methyl or propyl-p-hydroxybenzoates or sorbic acid). 

Formulations suitable for rectal administration are preferably 

15 presented as unit dose suppositories. These may be prepared by 
admixing the active compound with one or more conventional solid 
carriers, for example, cocoa butter, and then shaping the resulting 
mixture. 

Formulations suitable for topical application to the skin or to the 
20 eye preferably take the form of an ointment, cream, lotion, paste, gel, 
spray, aerosol and oil. Carriers which may be used include vaseline, 
lanoline, polyethylene glycols, alcohols, and combinations of two or more 
thereof. The topical formulations may further advantageously contain 
0.05 to 15 percent by weight of thickeners selected from among 
25 hydroxypropyl methyl cellulose, methyl cellulose, polyvinylpyrrolidone, 
polyvinyl alcohol, poly (alkylene glycols), poly/hydroxyalkyi, 
(meth)acrylates or poly(meth)acrylamides. A topical formulation is often 
applied by instillation or as an ointment into the conjunctival sac. It can 
also be used for irrigation or lubrication of the eye, facial sinuses, and 
30 external auditory meatus. It may also be injected into the anterior eye 
chamber and other places. The topical formulations in the liquid state 
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may be also present in a hydrophilic three-dimensional polymer matrix in 
the form of a strip, contact lens, and the like from which the active 
components are released. 

For administration by inhalation, the compounds for use herein can 
5 be delivered in the form of an aerosol spray presentation from pressurized 
packs or a nebulizer, with the use of a suitable propellant, e.g., dichlorodi- 
fluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon 
dioxide or other suitable gas. In the case of a pressurized aerosol, the 
dosage unit may be determined by providing a valve to deliver a metered 

10 amount. Capsules and cartridges of, e.g., gelatin, for use in an inhaler or 
insufflator may be formulated containing a powder mix of the compound 
and a suitable powder base such as lactose or starch. 

Formulations suitable for buccal (sublingual) administration include, 
for example, lozenges containing the active compound in a flavored base, 

15 usually sucrose and acacia or tragacanth; and pastilles containing the 
compound in an inert base such as gelatin and glycerin or sucrose and 
acacia. 

The compounds may be formulated for parenteral administration by 
injection, e.g., by bolus injection or continuous infusion. Formulations for 

20 injection may be presented in unit dosage form, e.g., in ampules or in 
multi-dose containers, with an added preservative. The compositions 
may be suspensions, solutions or emulsions in oily or aqueous vehicles, 
and may contain formulatory agents such as suspending, stabilizing 
and/or dispersing agents. Alternatively, the active ingredient may be in 

25 powder form for reconstitution with a suitable vehicle, e.g., sterile 
pyrogen-free water or other solvents, before use. 

Formulations suitable for transdermal administration may be 
presented as discrete patches adapted to remain in intimate contact with 
the epidermis of the recipient for a prolonged period of time. Such 

30 patches suitably contain the active compound as an optionally buffered 
aqueous solution of, for example, 0.1 to 0.2 M concentration with 
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respect to the active compound. Formulations suitable for transdermal 
administration may also be delivered by iontophoresis (see, e.g., 
Pharmaceutical Research 3 (6), 318 (1986)) and typically take the form of 
an optionally buffered aqueous solution of the active compound. 



controlled release means and/or delivery devices (see, e.g., in U.S. Patent 
Nos. 3,536,809; 3,598,123; 3,630,200; 3,845,770; 3,847,770; 
3,916,899; 4,008,719; 4,687,610; 4,769,027; 5,059,595; 5,073,543; 
5,120,548; 5,354,566; 5,591,767; 5,639,476; 5,674,533 and 

10 5,733,566). 

Desirable blood levels may be maintained by a continuous infusion 
of the active agent as ascertained by plasma levels. It should be noted 
that the attending physician would know how to and when to terminate, 
interrupt or adjust therapy to lower dosage due to toxicity, or bone 

15 marrow, liver or kidney dysfunctions. Conversely, the attending physician 
would also know how to and when to adjust treatment to higher levels if 
the clinical response is not adequate (precluding toxic side effects). 

The efficacy and/or toxicity of the MTSP protein inhibitor(s), alone 
or in combination with other agents can also be assessed by the methods 

20 known in the art (See generally, O'Reilly, Investigational New Drugs, 
15:5-13 (1997)). 

The active compounds or pharmaceutically acceptable derivatives 
may be prepared with carriers that protect the compound against rapid 
elimination from the body, such as time release formulations or coatings. 

25 Kits containing the compositions and/or the combinations with 

instructions for administration thereof are provided. The kit may further 
include a needle or syringe, preferably packaged in sterile form, for 
injecting the complex, and/or a packaged alcohol pad. Instructions are 
optionally included for administration of the active agent by a clinician or 

30 by the patient. 



5 



The pharmaceutical compositions may also be administered by 
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Finally, the compounds or MTSP proteins or protease domains 
thereof or compositions containing any of the preceding agents may be 
packaged as articles of manufacture containing packaging material, a 
compound or suitable derivative thereof provided herein, which is 
5 effective for treatment of a diseases or disorders contemplated herein, 
within the packaging material, and a label that indicates that the 
compound or a suitable derivative thereof is for treating the diseases or 
disorders contemplated herein. The label can optionally include the 
disorders for which the therapy is warranted. 

10 L, METHODS OF TREATMENT 

The compounds identified by the methods herein are used for 
treating or preventing neoplastic diseases in an animal, particularly a 
mammal, including a human, is provided herein. In one embodiment, the 
method includes administering to a mammal an effective amount of an 

15 inhibitor of an MTSP protein, whereby the disease or disorder is treated or 
prevented. In a preferred embodiment, the MTSP protein inhibitor used in 
the treatment or prevention is administered with a pharmaceutically 
acceptable carrier or excipient. The mammal treated can be a human. 
The inhibitors provided herein are those identified by the screening 

20 assays. In addition, antibodies and antisense nucleic acids are 
contemplated. 

The treatment or prevention method can further include 
administering an anti-angiogenic treatment or agent or anti-tumor agent 
simultaneously with, prior to or subsequent to the MTSP protein inhibitor, 

25 which can be any compound identified that inhibits the activity of an 
MTSP protein, and includes an antibody or a fragment or derivative 
thereof containing the binding region thereof against the MTSP protein, an 
antisense nucleic acid encoding the MTSP protein, and a nucleic acid 
containing at least a portion of a gene encoding the MTSP protein into 

30 which a heterologous nucleotide sequence has been inserted such that 
the heterologous sequence inactivates the biological activity of at least a 
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portion of the gene encoding the MTSP protein, in which the portion of 
the gene encoding the MTSP protein flanks the heterologous sequence so 
as to promote homologous recombination with a genomic gene encoding 
the MTSP protein. 
5 1 . Antisense treatment 

In a specific embodiment, as described hereinabove, MTSP protein 
function is reduced or inhibited by MTSP protein antisense nucleic acids, 
to treat or prevent neoplastic disease. The therapeutic or prophylactic 
use of nucleic acids of at least six nucleotides that are antisense to a 

10 gene or cDNA encoding MTSP protein or a portion thereof. An MTSP 
protein "antisense" nucleic acid as used herein refers to a nucleic acid 
capable of hybridizing to a portion of an MTSP protein RNA (preferably 
mRNA) by virtue of some sequence complementarily. The antisense 
nucleic acid may be complementary to a coding and/or noncoding region 

15 of an MTSP protein mRNA. Such antisense nucleic acids have utility as 
therapeutics that reduce or inhibit MTSP protein function, and can be 
used in the treatment or prevention of disorders as described supra. 

The MTSP protein antisense nucleic acids are of at least six 
nucleotides and are preferably oligonucleotides (ranging from 6 to about 

20 150 nucleotides, or more preferably 6 to 50 nucleotides). In specific 
aspects, the oligonucleotide is at least 10 nucleotides, at least 15 
nucleotides, at least 100 nucleotides, or at least 125 nucleotides. The 
oligonucleotides can be DNA or RNA or chimeric mixtures or derivatives 
or modified versions thereof, single-stranded or double-stranded. The 

25 oligonucleotide can be modified at the base moiety, sugar moiety, or 
phosphate backbone. The oligonucleotide may include other appending 
groups such as peptides, or agents facilitating transport across the cell 
membrane (see, e.g., Letsinger et al., Proc. NatL Acad. Sci. U.S.A. 
86:6553-6556 (1989); Lemaitre et al., Proc. Natl. Acad. Sci. U.S.A. 

30 84:648-652 (1987); PCT Publication No. WO 88/09810, published 

December 15, 1988) or blood-brain barrier (see, e.g., PCT Publication No. 
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WO 89/10134, published April 25, 1988), hybridization-triggered 
cleavage agents (see, e.g., Krol et al,, BioTechniques 6:958-976 (1988)) 
or intercalating agents (see, e.g,, Zon, Pharm, Res. 5:539-549 (1988)). 

The MTSP protein antisense nucleic acid is preferably an 
oligonucleotide, more preferably of single-stranded DNA. In a preferred 
aspect, the oligonucleotide includes a sequence antisense to a portion of 
human MTSP protein. The oligonucleotide may be modified at any 
position on its structure with substituents generally known in the art. 

The MTSP protein antisense oligonucleotide may comprise at least 
one modified base moiety which is selected from the group including, but 
not limited to 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 
hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) 
uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethyl- 
aminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethyl- 
guanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methyl- 
cytosine, N6-adenine, 7-methylguanine, 5-methylaminomethyluracil, 
5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 
5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6-isopen- 
tenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 
5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid 
(v), 5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, 
(acp3)w, and 2,6-diaminopurine. 

In another embodiment, the oligonucleotide includes at least one 
modified sugar moiety selected from the group including but not limited to 
arabinose, 2-fluoroarabinose, xylulose, and hexose. The oligonucleotide 
can include at least one modified phosphate backbone selected from a 
phosphorothioate, a phosphorodithioate, a phosphoramidothioate, a 
phosphoramidate, a phosphordiamidate, a methylphosphonate, an alkyi 
phosphotriester, and a formacetal or analog thereof. 
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The oligonucleotide can be an or-anomeric oligonucleotide. An a- 
anomeric oligonucleotide fornns specific double-stranded hybrids with 
complementary RNA in which the strands run parallel to each other 
(Gautier et al., Nucl. Acids Res. 15:6625-6641 (1987)). 
5 The oligonucleotide may be conjugated to another molecule, e.g., a 

peptide, hybridization triggered cross-linking agent, transport agent and 
hybridization-triggered cleavage agent. 

The oligonucleotides may be synthesized by standard methods 
known in the art, e,g, by use of an automated DNA synthesizer (such as 
10 are commercially available from Biosearch, Applied Biosystems, etc.). As 
examples, phosphorothioate oligonucleotides may be synthesized by the 
method of Stein et al. (NucL Acids Res. 16:3209 (1988)), 
methylphosphonate oligonucleotides can be prepared by use of controlled 
pore glass polymer supports (Sarin et al., Proc. Natl. Acad. Sci. U.S.A. 
15 85:7448-7451 (1988)), etc. 

In a specific embodiment, the MTSP protein antisense 
oligonucleotide includes catalytic RNA, or a ribozyme (see, e.g., PCT 
International Publication WO 90/11364, published October 4, 1990; 
Sarver et al.. Science 247 :1222-1225 (1990)). In another embodiment, 
20 the oligonucleotide is a 2'-0-methylribonucleotide (Inoue et al., Nucl. 
Acids Res. 15:6131-6148 (1987)), or a chimeric RNA-DNA analogue 
(Inoue et al., FEBS Lett. 215:327-330 (1987)). 

In an alternative embodiment, the MTSP protein antisense nucleic 
acid is produced intracellularly by transcription from an exogenous 
25 sequence. For example, a vector can be introduced in vivo such that it is 
taken up by a ceil, within which cell the vector or a portion thereof is 
transcribed, producing an antisense nucleic acid (RNA). Such a vector 
would contain a sequence encoding the MTSP protein antisense nucleic 
acid. Such a vector can remain episomal or become chromosomally 
30 integrated, as long as it can be transcribed to produce the desired 

antisense RNA. Such vectors can be constructed by recombinant DNA 
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technology methods standard in the art. Vectors can be plasmid, viral, or 
others known in the art, used for replication and expression in mammalian 
cells. Expression of the sequence encoding the MTSP protein antisense 
RNA can be by any promoter known in the art to act in mammalian, 
5 preferably human, cells. Such promoters can be inducible or constitutive. 
Such promoters include but are not limited to: the SV40 early promoter 
region (Bernoist and Chambon, Nature 290 :304-310 (1981), the promoter 
contained in the 3' long terminal repeat of Rous sarcoma virus 
(Yamamoto et al., Ce// 22:787-797 (1980), the herpes thymidine kinase 
10 promoter (Wagner et al., Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445 

(1981), the regulatory sequences of the metallothionein gene (Brinster et 
al.. Nature 296 :39-42 (1982), etc. 

The antisense nucleic acids include sequence complementary to at 
least a portion of an RNA transcript of an MTSP protein gene, preferably a 
15 human MTSP protein gene. Absolute complementarity, although 
preferred, is not required. 

The amount of MTSP protein antisense nucleic acid that will be 
effective in the treatment or prevention of neoplastic disease will depend 
on the nature of the disease, and can be determined empirically by 
20 standard clinical techniques. Where possible, it is desirable to determine 
the antisense cytotoxicity in cells in vitro, and then in useful animal model 
systems prior to testing and use in humans. 

2, Gene Therapy 

In an exemplary embodiment, nucleic acids that include a sequence 
25 of nucleotides encoding an MTSP protein or functional domains or 

derivative thereof, are administered to promote MTSP protein function, by 
way of gene therapy. Gene therapy refers to therapy performed by the 
administration of a nucleic acid to a subject. In this embodiment, the 
nucleic acid produces its encoded protein that mediates a therapeutic 
30 effect by promoting MTSP protein function. Any of the methods for gene 
therapy available in the art can be used (see, Goldspiel et al.. Clinical 
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Pharmacy 12:4-88-505 (1993); Wu and Wu, Bio therapy 3:87-95 (1991); 
Tolstoshev, An. Rev. Pharmacol. Toxicol. 32:573-596 (1993); Mulligan, 
Science 260 :926-932 (1993); and Morgan and Anderson, An. Rev. 
Biochem. 62:191-217 (1993); TIBTECH 11(5) :155-215 (1993). For 
5 example, one therapeutic composition for gene therapy includes an MTSP 
protein-encoding nucleic acid that is part of an expression vector that 
expresses an MTSP protein or domain, fragment or chimeric protein 
thereof in a suitable host. In particular, such a nucleic acid has a 
promoter operably linked to the MTSP protein coding region, the promoter 

10 being inducible or constitutive, and, optionally, tissue-specific. In another 
particular embodiment, a nucleic acid molecule is used in which the MTSP 
protein coding sequences and any other desired sequences are flanked by 
regions that promote homologous recombination at a desired site in the 
genome, thus providing for intrachromosomal expression of the MTSP 

15 protein nucleic acid (Koller and Smithies, Proc. Natl. Acad. Sci. USA 
86:8932-8935 (1989); Zijistra et al.. Nature 342:435-438 (1989)). 

Delivery of the nucleic acid into a patient may be either direct, in 
which case the patient is directly exposed to the nucleic acid or nucleic 
acid-carrying vector, or indirect, in which case, cells are first transformed 

20 with the nucleic acid in vitro, then transplanted into the patient. These 
two approaches are known, respectively, as in vivo or ex vivo gene 
therapy. 

In a specific embodiment, the nucleic acid is directly administered 
in vivo, where it is expressed to produce the encoded product. This can 

25 be accomplished by any of numerous methods known in the art, e.g., by 
constructing it as part of an appropriate nucleic acid expression vector 
and administering it so that it becomes intracellular, e.g., by infection 
using a defective or attenuated retroviral or other viral vector (see U.S. 
Patent No. 4,980,286), or by direct injection of naked DNA, or by use of 

30 microparticle bombardment (e.g., a gene gun; Biolistic, Dupont), or 
coating with lipids or cell-surface receptors or transfecting agents. 
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encapsulation in liposomes, nnicroparticles, or microcapsules, or by 
administering it in linkage to a peptide which is known to enter the 
nucleus, by administering it in linkage to a ligand subject to receptor- 
mediated endocytosis (see e.g., Wu and Wu, J. Biol, Chem. 262 :4429- 
5 4432 (1987)) (which can be used to target cell types specifically 

expressing the receptors), etc. In another embodiment, a nucleic acid- 
ligand complex can be formed in which the ligand is a fusogenic viral 
peptide to disrupt endosomes, allowing the nucleic acid to avoid 
lysosomal degradation. In yet another embodiment, the nucleic acid can 

10 be targeted in vivo for cell specific uptake and expression, by targeting a 
specific receptor (see, e.g., PCT Publications WO 92/06180 dated April 
16, 1992 (Wu et aL); WO 92/22635 dated December 23, 1992 (Wilson 
et al.); WO92/20316 dated November 26, 1992 (Findeis et al.); 
W093/14188 dated July 22, 1993 (Clarke et al.), WO 93/20221 dated 

15 October 14, 1993 (Young)). Alternatively, the nucleic acid can be 
introduced intracellularly and incorporated within host cell DNA for 
expression, by homologous recombination (Koller and Smithies, Proc. 
Natl. Acad. Sci. USA 86:8932-8935 (1989); Zijistra et al.. Nature 
342:435-438 (1989)). 

20 In a specific embodiment, a viral vector that contains the MTSP 

protein nucleic acid is used. For example, a retroviral vector can be used 
(see Miller et al., Meth. EnzymoL 217 :581-599 (1993)). These retroviral 
vectors have been modified to delete retroviral sequences that are not 
necessary for packaging of the viral genome and integration into host cell 

25 DNA. The MTSP protein nucleic acid to be used in gene therapy is cloned 
into the vector, which facilitates delivery of the gene into a patient. More 
detail about retroviral vectors can be found in Boesen et al., Biotherapy 
6:291-302 (1994), which describes the use of a retroviral vector to 
deliver the mdrl gene to hematopoietic stem cells in order to make the 

30 stem cells more resistant to chemotherapy. Other references illustrating 
the use of retroviral vectors in gene therapy are: Clowes et al., J. Clin. 
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Invest, 93:644-651 (1994); Kiem et al., fi/ooc/ 83: 1 467-1 473 (1994); 
Salmons and Gunzberg, Human Gene Therapy 4\^29-^4^^ (1993); and 
Grossman and Wilson, Curr. Opin. in Genetics and Devel, 3:1 10-1 14 

(1993) . 

5 Adenoviruses are other viral vectors that can be used in gene 

therapy. Adenoviruses are especially attractive vehicles for delivering 
genes to respiratory epithelia. Adenoviruses naturally infect respiratory 
epithelia where they cause a mild disease. Other targets for adenovirus- 
based delivery systems are liver, the central nervous system, endothelial 
10 cells, and muscle. Adenoviruses have the advantage of being capable of 
infecting non-dividing cells. Kozarsky and Wilson, Current Opinion in 
Genetics and Development 3'AQQ-b03 (1993) present a review of 
adenovirus-based gene therapy. Bout et al., Human Gene Therapy AO 

(1994) demonstrated the use of adenovirus vectors to transfer genes to 
15 the respiratory epithelia of rhesus monkeys. Other instances of the use of 

adenoviruses in gene therapy can be found in Rosenfeld et al.. Science 
252:431-434 (1991); Rosenfeld et al., Ce// 68:1 43-1 55 (1992); and 
Mastrangeli et aL, J, Clin. Invest. 91:225-234 (1993). 

Adeno-associated virus (AAV) has also been proposed for use in 
20 gene therapy (Walsh et al., Proc. Soc. Exp, Biol. Med. 204 :289-300 
(1993), 

Another approach to gene therapy involves transferring a gene to 
cells in tissue culture by such methods as electroporation, lipofection, 
calcium phosphate mediated transfection, or viral infection. Usually, the 

25 method of transfer includes the transfer of a selectable marker to the 

cells. The cells are then placed under selection to isolate those cells that 
have taken up and are expressing the transferred gene. Those cells are 
then delivered to a patient. 

In this embodiment, the nucleic acid is introduced into a cell prior 

30 to administration in vivo of the resulting recombinant cell. Such 

introduction can be carried out by any method known in the art, including 
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but not limited to transfection, electroporation, microinjection, infection 
with a viral or bacteriophage vector containing the nucleic acid 
sequences, cell fusion, chromosome-mediated gene transfer, microcell- 
mediated gene transfer, spheroplast fusion, etc. Numerous techniques 
5 are known in the art for the introduction of foreign genes into cells (see 
e.g,, Loeffler and Behr, Meth. EnzymoL 217 :599-618 (1993); Cohen et 
al., Meth, EnzymoL 217 :618-644 (1993); Cline, Pharmac. Then 29:69-92 
(1985)) and may be used, provided that the necessary developmental and 
physiological functions of the recipient cells are not disrupted. The 

10 technique should provide for the stable transfer of the nucleic acid to the 
cell, so that the nucleic acid is expressible by the cell and preferably 
heritable and expressible by its cell progeny. 

The resulting recombinant cells can be delivered to a patient by 
various methods known in the art. In a preferred embodiment, epithelial 

15 cells are injected, e.g., subcutaneously. In another embodiment, 

recombinant skin cells may be applied as a skin graft onto the patient. 
Recombinant blood cells (e.g., hematopoietic stem or progenitor cells) are 
preferably administered intravenously. The amount of cells envisioned for 
use depends on the desired effect, patient state, etc., and can be 

20 determined by one skilled in the art. 

Cells into which a nucleic acid can be introduced for purposes of 
gene therapy encompass any desired, available cell type, and include but 
are not limited to epithelial cells, endothelial cells, keratinocytes, 
fibroblasts, muscle cells, hepatocytes; blood cells such as T lymphocytes, 

25 B lymphocytes, monocytes, macrophages, neutrophils, eosinophils, 
megakaryocytes, granulocytes; various stem or progenitor cells, in 
particular hematopoietic stem or progenitor cells, e.g., as obtained from 
bone marrow, umbilical cord blood, peripheral blood, fetal liver, etc. 
In a preferred embodiment, the cell used for gene therapy is 

30 autologous to the patient. In an embodiment in which recombinant cells 
are used in gene therapy, an MTSP protein nucleic acid is introduced into 
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the cells such that it is expressible by the cells or their progeny, and the 
recombinant cells are then administered in vivo for therapeutic effect. In 
a specific embodiment, stem or progenitor cells are used. Any stem 
and/or progenitor cells which can be isolated and maintained in vitro can 
5 potentially be used in accordance with this embodiment. Such stem cells 
include but are not limited to hematopoietic stem cells (HSC), stem cells 
of epithelial tissues such as the skin and the lining of the gut, embryonic 
heart muscle cells, liver stem cells (PCT Publication WO 94/08598, dated 
April 28, 1994), and neural stem cells (Stemple and Anderson, Cell 

10 21:973-985 (1992)). 

Epithelial stem cells (ESCs) or keratinocytes can be obtained from 
tissues such as the skin and the lining of the gut by known procedures 
(Rheinwald, Meth. Cell Bio. 21 A :229 (1980)). In stratified epithelial 
tissue such as the skin, renewal occurs by mitosis of stem cells within the 

15 germinal layer, the layer closest to the basal lamina. Stem cells within 
the lining of the gut provide for a rapid renewal rate of this tissue. ESCs 
or keratinocytes obtained from the skin or lining of the gut of a patient or 
donor can be grown in tissue culture (Rheinwald, Meth. Cell Bio. 21 A :229 
(1980); Pittelkow and Scott, Mayo Clinic Proc. 61:771 (1986)). If the 

20 ESCs are provided by a donor, a method for suppression of host versus 
graft reactivity {e.g., irradiation, drug or antibody administration to 
promote moderate immunosuppression) can also be used. 

With respect to hematopoietic stem cells (HSC), any technique 
which provides for the isolation, propagation, and maintenance in vitro of 

25 HSC can be used in this embodiment. Techniques by which this may be 
accomplished include (a) the isolation and establishment of HSC cultures 
from bone marrow cells isolated from the future host, or a donor, or 
(b) the use of previously established long-term HSC cultures, which may 
be allogeneic or xenogeneic. Non-autologous HSC are used preferably in 

30 conjunction with a method of suppressing transplantation Immune 

reactions of the future host/patient. In a particular embodiment, human 
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bone marrow cells can be obtained from the posterior iliac crest by needle 
aspiration (see, e.g,, Kodo et al., J. Clin. Invest. 73:1377-1384 (1984)). 
In a preferred embodiment, the HSCs can be made highly enriched or in 
substantially pure form. This enrichment can be accomplished before, 
5 during, or after long-term culturing, and can be done by any techniques 
known in the art. Long-term cultures of bone marrow cells can be 
established and maintained by using, for example, modified Dexter cell 
culture techniques {Dexter et al., J, Cell PhysioL 91:335 (1977) or 
Witlock-Witte culture techniques (Witlock and Witte, Proc. Natl. Acad. 

10 Sci. USA 79:3608-3612 (1982)). 

In a specific embodiment, the nucleic acid to be introduced for 
purposes of gene therapy includes an inducible promoter operably linked 
to the coding region, such that expression of the nucleic acid is 
controllable by controlling the presence or absence of the appropriate 

15 inducer of transcription. 

3. Prodrugs 

A method for treating tumors is provided. The method is practiced 
by administering a prodrug that is specifically cleaved by an MTSP to 
release an active drug. Upon contact with a cell that expresses MTSP 

20 activity, the prodrug is converted into an active drug. The prodrug can be 
a conjugate that contains the active agent, such as an anti-tumor drug, 
such as a cytotoxic agent, or other atherapeutic agent, linked, linked to a 
substrate for the targeted MTSP, such that the drug or agent is inactive 
or unable to enter a cell, in the conjugate, but is activated upon cleavage. 

25 The prodrug, for example, can contain an oligopeptide, preferably a 

relatively short, less than about 10 amino acids peptide, that is selectively 
proteolytically cleaved by the targeted MTSP. Cytotoxic agents, include, 
but are not limited to, alkylating agents, antiproliferative agents and 
tubulin binding agents. Others include, vinca drugs, mitomycins, 

30 bleomycins and taxanes. 



M. 



ANIMAL MODELS 
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Transgenic animal models are provided herein. Such an animal can 
be initially produced by promoting homologous recombination between an 
MTSP protein gene in its chromosome and an exogenous MTSP protein 
gene that has been rendered biologically inactive (preferably by insertion 
5 of a heterologous sequence, e.g., an antibiotic resistance gene). In a 
preferred aspect, this homologous recombination is carried out by 
transforming embryo-derived stem (ES) cells with a vector containing the 
insertionally inactivated MTSP protein gene, such that homologous 
recombination occurs, followed by injecting the ES cells into a blastocyst, 

10 and implanting the blastocyst into a foster mother, followed by the birth 
of the chimeric animal ("knockout animal") in which an MTSP protein 
gene has been inactivated (see Capecchi, Science 244 : 1 288-1 292 
(1989)). The chimeric animal can be bred to produce additional knockout 
animals. Such animals can be mice, hamsters, sheep, pigs, cattle, etc, 

15 and are preferably non-human mammals. In a specific embodiment, a 
knockout mouse is produced. 

Such knockout animals are expected to develop or be predisposed 
to developing neoplastic diseases and thus can have use as animal 
models of such diseases e.g., to screen for or test molecules for the 

20 ability to treat or prevent such diseases or disorders. Hence, the animal 
models for are provided. Such an animal can be initially produced by 
promoting homologous recombination between an MTSP gene in its 
chromosome and an exogenous MTSP protein gene that would be over- 
expressed or mis-expressed (preferably by expression under a strong 

25 promoter). In a preferred aspect, this homologous recombination is 

carried out by transforming embryo-derived stem (ES) cells with a vector 
containing the over-expressed or mis-expressed MTSP protein gene, such 
that homologous recombination occurs, followed by injecting the ES cells 
into a blastocyst, and implanting the blastocyst into a foster mother, 

30 followed by the birth of the chimeric animal in which an MTSP gene has 
been over-expressed or mis-expressed (see Capecchi, Science 
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244 :1288-1292 (1989)). The chimeric animal can be bred to produce 
additional animals with over-expressed or mis-expressed MTSP protein. 
Such animals can be mice, hamsters, sheep, pigs, cattle, etc., and are 
preferably non-human mammals. In a specific embodiment, a mouse with 
5 over-expressed or mis-expressed MTSP protein is produced. 

The following examples are included for illustrative purposes only 
and are not intended to limit the scope of the invention. 



Cloning of MTSP3, cloning and mutagenesis of the Protease domain of 
MTSP3 

1 - Identification and cloning of MTSP3 



DNA encoding the protease domain of the protease designated 
MTSPI was independently cloned from the human prostatic 
adenocarcinoma cell line, PC-3, using degenerate oligonucleotide primers, 

20 then sequenced and characterized (see EXAMPLE 6). The sequence of 
the sense degenerate primer used in cloning MTSPI was 
5'-TGGRT(l)VT{l)WS(l)GC(l)RC(l)CAYTG-3' (SEQ ID No: 13), and that of 
the anti-sense was 5'-(l)GG{l)CC(l)CC{l)SWRTC(l)CCYT(l)RCA(l)GHRTC-3' 
(SEQ ID No:14), where R-A,G; V = G,A,C; W = A,T; S = G,C; Y = C,T; 

25 H=A,T,C. The primer sequences correspond to two highly conserved 
regions in all serine proteases and should amplify PGR products ranging 
from 400 to 500 base pairs. MTSPI was subsequently found to be 
identical to matriptase (Genbank accession number AF1 18224; see also 
Takeuchi et al., Proc. Natl. Acad, ScL USA, 96(20) :1 1054-61 (1999); 

30 and Lin et al., J. BioL Chem., 274(26) :18231-6 1999). 

Using the protein sequence of the protease domain of the serine 
protease MTSPI, the EST database (dbEST) at the National Center for 



EXAMPLE 1 



15 



a. 



Identification of EST clones AI924527 and AI924182 
as part of a serine protease MTSP3 
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Biotechnology Information (Bethesda, MD; www.ncbi.nlm.nih.gov) was 

searched for EST clones that contain similar or identical sequences to 

MTSP1 using the search algorithm tbiastn. The tbiastn algorithm 

compares a protein query sequence against a nucleotide sequence 

5 database dynamically translated in all six reading frames (both strands). 

The sequences for two identical EST clones (NCI CGAP Lul 9 AI924527 

and AI924182) derived from human lung tumor tissue showed 43% 

identity with the MTSP1 protein sequence. Subsequent search of 

GenBank and SwissProt database for the EST sequence AI924527 and 

10 AI924182 did not show any matching sequence to MTSP1, indicating 

that the sequence contained in these EST clones AI924527 and 

AI924182 may be portions of a new serine protease. 

b. PCR cloning of a cDNA fragment of another membrane 
type serine protease MTSPS 

15 

The double-stranded Marathon-Ready(tm) cDNA library derived 
from human lung carcinoma (LX-I) was obtained from Clontech (Palo 
Alto, CA; catalog # 7495-1) and used as a template. Two primers, 
5'-TCACCGAGAAGATGATGTGTGCAGGCATCC-3' (SEQ ID No: 15) 

20 (sense primer), and 5'~GGGACAGGGGCTGTAAGGCAGGGAATGAG-3' 

(SEQ ID No: 16) (antisense primer), were used to amplify a —360 bp DNA 
fragment. The PCR product was separated on a 2% agarose gel and 
purified using a gel extraction kit (catalog number 28706; QIAquick gel 
extraction kit; Qiagen). The purified DNA fragment was ligated into TA 

25 vectors (catalog number K4500-01; TOPO-TA cloning kit, Invitrogen, 
Carlsbad, CA). After transformation into E. coli cells, plasmids were 
isolated and analyzed by digestion with EcoRI restriction enzyme. Clones 
that had inserted DNA were further characterized by sequencing using a 
fluorescent dye-based DNA sequencing method (catalog number 

30 4303149; BigDye terminator cycle sequencing kit with AmpliTaq DNA 
polymerase; Perkin Elmer, Lincoln, CA). 
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The DNA sequence obtained was analyzed and has 43% identity 
with the MTSP1 protein sequence. This indicates that the LX-1 cDNA 
library contains a desired nucleic acid molecule. It was used to isolate a 
cDNA clone encompassing a full length protease. 
5 c- 5'- and 3'- rapid amplification of cDNA ends (RACE) 

To obtain the full-length cDNA that encoded this serine protease, 
hereafter called MTSPS, 5'- and 3'-RACE reactions were performed. The 
Marathon-Ready cDNA library from human lung carcinoma (LX-1) was 
used to isolate the 5' and 3' ends of the cDNA encoding MTSP3. 
10 Marathon-Ready cDNA is specifically made for RACE reactions. Two 
gene specific primers were used: 

5'-CCCGCAGCCATAGCCCCAGCTAACG-3' (SEQ ID No. 17) for 5'-RACE 

reaction and 5'-GCAGACGATGCGTACCAGGGGGAAGTC-3' (SEQ ID No. 

18) for 3'-RACE reaction. Two fragments, approximately 1.8 kbp and 

15 0.85 kbp, were isolated that correspond to the missing 5' and 3' end 

sequences, respectively. These fragments were subcloned as described 

above. They were further confirmed by Southern analysis using an 

internal cDNA fragment encompassing the 2 primers used in the RACE 

reactions as probe and by DNA sequence analysis. 

20 d. PGR amplification of cDNA encoding full-length 

protease domain of MTSP3 

To obtain the cDNA fragment encoding the protease domain of 
MTSP3, an end-to-end PCR amplification using gene-specific primers was 
25 used. The two primers used were: 

5^-CTCGAGAAAAGA GTGGTGGGTGGGGAGGAGGCCTCTGTG -3^ (SEQ ID 
No. 19) for the 5' end and 

5'-GCGGCCGCAIIACAGCTCAGCCTTCCAGAC-3' (SEQ ID No. 20) for 
the 3' end. The 5' primer contains the sequence (underlined) that 
30 encodes the start of the MTSP3 protease domain (VVGGEEASV). The 3' 
primer contains the stop codon (underlined) of MTSP3. A "-700-bp 
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fragment was amplified and subcloned into a Pichia pastoris expression 
vector, pPIC9K. 

e. C310S mutagenesis of MTSPS 

To eliminate the free cysteine (at position 310 in SEQ ID No. 4) 
5 that exists when the protease domain of the MTSP3 protein is expressed 
or the zymogen is activated, the free cysteine at position 310 (see SEQ ID 
No. 3), which is Cys122 if a chymotrypsin numbering scheme Is used, 
was replaced with a serine. The resulting vector was designated 
pPIC9K:MTSP3C122S. 

10 The gene encoding the protease domain of l\/ITSP3 was 

mutagenized by PGR SOE (PCR-based splicing by overlap extension) to 
replace the unpaired cysteine at position 310 (122 chymotrypsin 
numbering system) with a serine. Two overlapping gene fragments, each 
containing the TCT codon for serine at position 310 were PGR amplified 

15 using the following primers: for the 5' gene fragment, 

TCTCTCGAGAAAAGAGTGGTGGGTGGGTGGGGAGGAGGCCTCTGTG SEQ ID 
No. 51 and GCTCCTCATCAAAGAAGGGCAGAGAGATGGGCCTGACTGTGCC 
SEQ ID No. 52; for the 3' gene fragment, 

ATTCGCGGCCGCATTACAGCTCAGCCTTCCAGAC (SEQ ID No, 53) and 
20 GGCACAGTCAGGCCCATCTCTCTGCCCTTCTTTGATGAGGAGC (SEQ ID No, 
54). The amplified gene fragments were purified on a 1 % agarose gel, 
mixed and reamplified by PGR to produce the full length coding sequence 
for MTSP3 C122S. This sequence was then cut with restriction enzymes 
NotI and Xhol, and ligated into vector pPic9K. 
25 2. Sequence analysis 

All derived DNA and protein sequences were analyzed using 
MacVector (version 6.5; Oxford Molecular Ltd., Madison, Wl). The 
full-length cDNA encoding MTSP3 is composed of 2,137 base pairs 
containing the longest open reading frame of 1,314 base pairs which 
30 translate to a 437-amino acid protein sequence. The cDNA fragment (nt 
873-1,574) encoding the protease domain of MTSP3 is composed of 702 
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base pairs which translate to a 233-amino acid protein sequence plus the 
stop codon. The DNA sequence and the translated protein sequence of 
MTSP3 are shown in SEQ ID Nos. 3 and 4, respectively. 
3. Construction of the Expression Vectors 
5 DNA encoding l\/ITSP3 full length protein containing the C310S 

point mutation (i.e., [\/ITSP3C122S) was cloned fronn 
pPIC9K:MTSP3C122S. The prinners MTSP3: 

5' GAATTCCATATGCCGCGCTTTAAAGTGGTGGGTGGGGAGGAGGCC SEQ ID 
No. 47 (containing a Ndel restriction site) and MTSP3- 

10 3' GGGATACCCGTTACAGCTCAGCCTTCCAGAC 5' SEQ ID No. 48 

(containing a BamHl restriction site) were used to PGR amplify the human 
MTSP3C122S protease domain utilizing a plasmin recognition sequence 
(PRFK) for zymogen activation. Amplification was conducted in a total 
volume of 50 jj\ containing 10 mM KCI, 20 mM Tris-HCI (pH 8.8 at 25 

15 °C), 10 mM (NH4 )2 SO4 , 2.0 mM MgSO^ , 0.1% Triton X-100, 0.3 mM 
dNTPs, 5.0 units of vent DNA polymerase, and 100 pmol of primers. The 
reaction mixtures were heated to 95 ""C for 5 min, followed by 25-30 
cycles of 95, 60, and 75 °C for 30 s each and a final extension at 75 °C 
for 2 min. 

20 PGR products were purified using a QIAquick PGR purification kit 

(QIAGEN Inc., Chatsworth, CA). Full-length oligonucleotides were doubly 
digested with 10 units BamHl and 20 units Ndel for 2 h at 37 °C. The 
digested fragments were purified on a 1 .3% agarose gel and stained with 
ethidium bromide. The band containing the MTSP3C122S encoding DNA 

25 was excised and purified using a QIAEX II gel extraction kit. 

The MTSP3G122S encoding DNA was then cloned into the Ndel 
and BamHl sites of the pET19b vector (Novagen) using standard methods. 
This vector allows the fusion of a HlSg tag for purification by metal 
affinity chromatography (MAC). Competent XL1 Blue cells (Stratagene) 

30 were transformed with the pETI 9b-MTSP3C1 22S vector and used to 
produce plasmid stocks. Proper insertion and DNA sequence were 
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confirmed by fluorescent thermal dye DNA sequencing methods as well 
as restriction digests. 

4- Protein Expression, Purification, and Refolding 

Overexpression of the gene product was achieved in E. coli strain 
5 BL21(DE3) (Novagen, Madison Wl) containing the DNAY plasmid for rare 
codon optimization (see, e.g,, Garcia et al. (1986) Ce// 45;. 45 3-45 9). 
Cells were grown at 37 °C in (2xYT) media supplemented with 
carbanicillin and kanamycin to a final concentrations of 50 ug/ml and 34 
ug/ml, respectively. One liter cultures were inoculated with 10 mL of an 
10 overnight culture grown in the same media. Cells were allowed to grow to 
a density of 0.6 - 1 .0 ODgoo before the addition of IPTG (final 
concentration 1.0 mM). Cells were grown an additional 4 hours before 
^\ harvesting. 

%J The cell pellet was resuspended in 20 mL of lysis buffer (50 mM 

15 Na2HP04, 300 mM NaCI, pH 7.4). The cell suspension was treated with 
10-20 mg lysozyme and incubated at 37°C for 1 hour. DNasel was then 
added (1-2mg) with mixing until the solution was no longer viscous. The 
1 5 solution was then transferred to a Rosette flask and sonicated, on ice, at 

D high power for 15 min. Inclusion bodies were pelleted by centrifugation 

D 20 at 20K rpm (-48,000 g) at 4°C for 30 min. 

^ Inclusion bodies were washed by douncing 2 times in 50 mM 

Na2HP04, 300 mM NaCI, 5% LADO, pH 7.4 followed by 2 times in 50 
mM Na2HP04, 300 mM NaCI, pH 7.4. Inclusion bodies (- 500 mg) are 
solubilized in 25 mL 6 M GuHCI, 100 mM tris-HCI, 20 mM ^Me, pH 8.0. 

25 This solution was spun at 20K rpm for 30 minutes to pull down any 

particulate matter. This solution was passed through a 0.2 //M filter and 
diluted to 100 mL in solubilization buffer. 

MTSP3C122S was refolded by slowly adding the inclusion body 
mixture to 8 L of refolding buffer (100 mM tris-HCI, 150 mM NaCI, 5 mM 

30 GSH, 0.05 mM GSSG, 1 M arginine, pH 8.0) using a peristaltic pump. 
The refolding mixture was allowed to stir at 4°C for 7 days or until the 
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thiol concentration was below 1 nnM as detected by Ellman's reagent. 
The solution was filtered through a 5 //M filter, concentrated by 
ultrafiltration and the buffer exchanged into MAC equilibration buffer (50 
mM Na2HP04, 300 mM NaCl, 10 mM imidazole, pH 8.0) by crossflow 
5 filtration. The resulting solution was passed through a 0.2 ijM filter and 
further purified on a FPLC (Amersham-Pharmacia) using Pharmacia 
chelating sepharose. The solution was loaded onto the nickel loaded 
MAC at a flow rate of 1 .0 mL/min and eluted with a linear gradient of 1 .0 
mM -1.0 M imidazole in 50 mM NasHPO^, 300 mM NaCI, pH 8.0. Protein 
10 containing fractions were determined by SDS-PAGE and subsequently 
pooled and frozen at -80*^C. 

Small amounts of purified MTSP3C122S were activated using 
plasmin sepharose for 30 min. at 37°C. The resin was spun down at 
14K rpm for 5 min. and the protein solution removed. The resulting 
15 solution was screened for activity against of series of protease 
substrates; spec-tpa, spec-pl, spec-UK, spec-fXlla (American 
Diagnostica), S-2238, S-2266 (Kabi Diagnostica), S-2586, S-2366, S- 
2444, S-2288, S-2251, S-2302, S-2765, S-2222, spec-THE 
(Chromogenix), spec-fVlla (Pentapharm). MTSP3C122S cleaved several 
20 of these substrates efficiently but was most active towards Spec-fXIla, 
Spec-tPA, S-2765, Spec-fVlla and S-2444. 

5, Gene expression profile of the serine protease MTSP3 in normal 
and tumor tissues 

25 To obtain information regarding the tissue distribution of the 

MTSP3 transcripts, the DNA insert encoding the MTSP3 protease domain 
was used to probe a RNA blot composed of 76 different human tissues 
(catalog number 7775-1; human multiple tissue expression (MTE) array; 
CLONTECH, Palo Alto, CA). The expression pattern observed in 

30 decreasing signal level was: trachea = colon (descending) = esophagus 
> colon (ascending) > colon (transverse) = rectum > ileum > 
duodenum > jejunum > bladder > ilocecum > stomach > kidney > 
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appendix. It is also expressed less abundantly in fetal kidney, and in two 
tumor cell lines, HeLa S3 and leukemia, K-562. Northern analysis using 
RNA blots (catalog numbers 7780-1, 7765-1 & 7782-1; human 1 2-lane, 
human muscle and human digestive system multiple tissue northern 
5 (MTN) blots; CLONTECH) confirmed that the expression was detected 
most abundantly in the colon, moderately in the esophagus, small 
intestine, bladder and kidney, and less abundantly in stomach and rectum. 
A single transcript of -2.2 kb was detected. 

Amplification of the MTSP3 transcript in several human primary 
10 tumors xenografted in mouse was performed using gene-specific primers. 
The MTSP3 transcript was detected in lung carcinoma (LX-1), colon 
adenocarcinoma (CX-1), colon adenocarcinoma (GI-112) and ovarian 
carcinoma (GI-102). No apparent signal was detected in another form of 
lung carcinoma {Gl-117), breast carcinoma (GI-101), pancreatic 
15 adenocarcinoma {Gl-103) and prostatic adenocarcinoma {PC3). 

EXAMPLE 2 
Identification of genomic clone of MTSP4 

Using the nucleotide sequence encoding the protease domain of the 
serine protease MTSP1 (also called matriptase), the protein database 
20 (SWISSPROT) at the National Center for Biotechnology Information 
(Bethesda, MD; <http://www.ncbi.nlm.nih.gov>) was searched for 
similar or identical sequence to MTSP1 using the search algorithm blastx. 
The blastx algorithm compares the six-frame conceptual translation 
products of a nucleotide query sequence (both strands) against a protein 
25 sequence database. A protein encoding sequence (CAA18442) that has 
37% identity to the MTSP1 protein sequence that was found to include a 
putative LDL-receptor domain and a trypsin-like serine protease domain 
was identified. This protein-encoding sequence (hereinafter referred to as 
MTSP4) was found to be encoded by a genomic clone (AL022314) 
30 derived from human chromosome 22 sequenced by the Sanger Centre 
Chromosome 22 Mapping Group and deposited into the public database 



-167- 





24745-1607 



as part of the Human Genome Project. Subsequent search of the 

GenBank database showed that no identical sequence has been 

deposited. A search of the EST database also did not show any matching 

human sequence, indicating that no human EST clone exists in the public 

5 database. Mouse EST clones (AI391417 and AA208793) are present and 

showed 88% identity to the serine protease at the nucleotide level. 

PGR cloning of a genomic DNA fragment of MTSP4 for use as 
hybridization probe 

In order to obtain tissue distribution profile of MTSP4 as well as to 

10 identify a tissue source for subsequent cloning of the cDNA, a genomic 
fragment was amplified from human genomic DNA using two 
gene-specific primers, 5'-CCTCCACGGTGCTGTGGACCGTGTTCC-3' {5' 
primer) SEQ ID No. 21 and 5'-CCTCGCGCAAGGCGCCCCAGCCCG-3' (3' 
primer) SEQ ID No. 22. These two primers amplified a 265-base pair 

15 fragment within a single exon of MTSP4. The fragment was then used as 
a hybridization probe on human tissue northern blot (human 12-lane 
multiple tissue northern (MTN) blot (catalog number 7780-1); CLONTECH, 
Palo Alto, CA). A prominent band ( — 2,6 kb) was detected in liver. 
Relatively weaker signals were obtained from the brain, heart, skeletal 

20 muscle and kidney. Since human liver showed a very strong signal, this 
tissue was selected for the amplification of the MTSP4 cDNA. 
5'- and 3'- rapid amplification of cDIMA ends (RACE) 

To obtain a full-length clone encoding MTSP4, 5'- and 3'-RACE 
reactions were performed. The Marathon-Ready cDNA library from 

25 human liver (CLONTECH) was used to isolate the 5' and 3' ends of the 
cDNA encoding MTSP4. Marathon-Ready cDNA clones are specifically 
made for RACE reactions. Two gene specific primers were used: 
5'-GCGTGGCGTCACCTGGTAGCGATAGACCTCGC -3' (SEQ ID No, 23) 
for 5'-RACE reaction and 5'-CCTCCACGGTGCTGTGGACCGTGTTCC-3' 

30 (SEQ ID No. 24) for 3'-RACE reaction. No fragment was obtained from 
the initial 5'-RACE reaction. 
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The 3'-RACE reaction, however, produced a —1.5 kbp fragment. 
A nested PCR reaction was used on the initial 5'-RACE reaction products 
to obtain part of the 5' end of MTSP4. The nested 5' gene-specific 
primer used was 5'-CCTCGCGCAAGGCGCCCCAGCCCG-3' (SEQ ID No. 
5 25) and produced a -^0.8 kbp fragment. The fragments were subcloned 
into pCR2.1-TOPO TA cloning vector (Invitrogen, Carlsbad, CA). The 
resulting clones were analyzed by Southern analysis using the internal 
genomic fragment encompassing the primers used in the RACE reactions 
as probe and by DNA sequence analysis. Sequence analysis of the 
10 5'-RACE product showed that the potential initiation codon was still 
missing. 

To obtain the 5' cDNA end that encodes the N terminus of MTSP4, 
the publicly available genomic sequence of chromosome 22 was searched 
for sequence corresponding to the sequence obtained in the 5'-RACE 
15 clone. The resulting genomic sequence was translated and the protein 
sequence was compared to that derived from the translated sequence of 
the 5'-RACE clone. After determining the overlapping sequences, a 
gene-specific oligonucleotide primer 

{5'-TCATCGGCCAGAGGGTGATCAGTGAG-3') SEQ ID No. 26 
20 corresponding to the sequence upstream of the potential initiation codon 
and another gene-specific oligonucleotide primer 
{5'-CCTCCTCAGTGCATAGGCATCAAACCAG-3') SEQ ID No. 27 
corresponding to a sequence within the overlapping region were used to 
amplify the missing 5' cDNA of MTSP4 from the human liver cDNA 
25 library. 

Splice variants and domain organization of MTSP4 

At least two cDNA fragments were consistently obtained during 
PCR amplification, indicating multiple splice variants of MTSP4. 
Subcloning and sequence analysis revealed that a longer, more abundant 
30 form, MTSP4-L and a shorter form, MTSP4-S. The encoded proteins are 
multi-domain, type II membrane-type serine proteases and include a 
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transmembrane domain at the N terminus followed by a CUB domain, 3 

LDLR domains and a trypsin-like serine protease domain at the C 

terminus. The difference between these two forms of MTSP4 is the 

absence in MTSP4-S of a 432-bp nucleotide sequence between the 

5 transmembrane and the CUB domains (see FIGURE 2). 

PGR amplification of cDNA encoding full-length protease domain of 
MTSP4 

To obtain a cDNA fragment encoding the protease domain of 
MTSP4, an end-to-end PCR amplification using gene-specific primers and 
10 the Marathon-Ready cDNA library from human liver was used. The two 
primers used were: 

5'-TCTCTCGAGAAAAGAATTGTTGGTGGAGCTGTGTCCTCCGAG -3' 
(SEQ ID No. 28 ) for the 5' end and 

5'-AGGTGGGCCTTGCTTTGCAGGGGGGCAGTTC-3' for the 3' end SEQ 
15 ID NO. 29). The 5' primer contained the sequence that encodes the start 
of the MTSP4 protease domain (IVGGAVSSE). The 3' primer corresponds 
to the sequence just downstream of the stop codon. A — 740-bp 
fragment was amplified, subcloned into pCR2,1-TOPO TA cloning vector 
and sequenced. 

20 Gene expression profile of l\/ITSP4 in normal and tumor tissues 

To obtain information regarding the gene expression profile of the 
MTSP4 transcript, a DNA fragment encoding part of the LDL receptor 
domain and the protease domain was used to probe an RNA blot 
composed of 76 different human tissues (catalog number 7775-1; human 

25 multiple tissue expression (MTE) array; CLONTECH). As in the northern 
analysis of gel blot, a very strong signal was observed in the liver. 
Signals in other tissues were observed in (decreasing signal level): fetal 
liver > heart = kidney = adrenal gland = testis = fetal heart and kidney 
= skeletal muscle = bladder = placenta > brain = spinal cord = colon 

30 = stomach = spleen = lymph node = bone marrow = trachea uterus 
= pancreas = salivary gland = mammary gland = lung. MTSP4 is aJso 
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expressed less abundantly in several tumor cell lines including HeLa S3 == 
leukemia K-562 = Burkitt's lymphomas (Raji and Daudi) = colorectal 
adenocarcinoma (SW480) > lung carcinoma (A549) = leukemia MOLT-4 
= leukemia HL-60. PGR of the MTSP4 transcript from cDNA libraries 
5 made from several human primary tumors xenografted in nude mice 
(human tumor multiple tissue cDNA panel, catalog number K1 522-1, 
CLONTECH) was performed using MTSP4-specific primers. The MTSP4 
transcript was detected in breast carcinoma (GI-101), lung carcinoma 
(LX-1), colon adenocarcinoma (GI-1 12) and pancreatic adenocarcinoma 

10 (Gl-103). No apparent signal was detected in another form of lung 
carcinoma (GI-1 17), colon adenocarcinoma (CX-1), ovarian carcinoma 
(GI-1 02), and prostatic adenocarcinoma (PCS). The MTSP4 transcript 
was also detected in LNCaP and PC-3 prostate cancer cell lines as well as 
in HT-1080 human fibrosarcoma cell line. 

15 Sequence analysis 

MTSP4 DNA and protein sequences were analyzed using 
MacVector (version 6.5; Oxford Molecular Ltd., Madison, Wl), The ORF 
of MTSP4-L includes 2,409 bp, which translate to a 802-amino acid 
protein, while the ORF of MTSP4-S is composed of 1,977 bp which 

20 translate to a 658-amino acid protein. The cDNA encoding the protease 
domain in both forms is composed of 708 bp which translate to a 
235-amino acid protein sequence (see, SEQ ID No. 6) The DNA 
sequences and the translated protein sequences of MTSP4-L and 
MTSP4-S, and of the protease domain of MTSP4 are set forth in SEQ ID 

25 Nos. 8, 10 and 6, respectively. 

EXAMPLE 3 

Cloning of MTSP6 

Identification of genomic clone of MTSP6 

Using the protein sequence of the protease domain of the serine 
30 protease MTSP4 (see EXAMPLE 2), the non-redundant database (all 

non-redundant GenBank CDS translations + PDB + SwissProt 4- PIR + 
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PRF) at the National Center for Biotechnology Information (Bethesda, MD; 
<http://www.ncbi.nlm.nih.gov>) was searched for sequences that were 
similar or identical to MTSP4 using the search algorithm tbiastn. The 
tbiastn algorithm compares a protein query sequence against a nucleotide 
5 sequence database dynamically translated in all reading frames. A protein 
(55 amino acids), which has 60% identity with the query MTSP4 
sequence (55 amino acids), was obtained from the translation of genomic 
sequence of AC015555 (nucleotide #15553 to 15717). This protein 
hereafter is referred to as MTSP6. Subsequent search of the GenBank 

10 database showed that no cDNA encoding MTSP6 has been deposited. 

The gene exhibiting highest homology to MTSP6 was human 
transmembrane serine protease 2 (GenBank accession number U75329; 
Swissprot accession number 015393), which showed 66% identity to 
MTSP6 within the 45 amino acid regions compared. Consequently, the 

15 nucleotide sequence encoding the MTSP6 protease domain was obtained 
by comparing the protein sequence of human transmembrane serine 
protease 2 protease domain with the nucleotide sequence of AC015555 
translated in six reading frames. The protein sequence obtained from the 
translated nucleotide sequence of MTSP6 revealed an overall 50% 

20 identity with human transmembrane serine protease 2. A search of the 
EST database indicated the presence of seven MTSP6 EST clones 
(AA883068, AW591433, AI978874, AI469095, AI935487, AA534591 
and AI758271). 

Cloning of human MTSP6 full-length cDNA 

25 To obtain cDNA encoding the region of the MTSP6 protease 

domain identified by database searches described above, two 
gene-specific primers, Ch17-NSP-l, 

5'-TCACGCATCGTGGGTGGAACATGTCC-3' (5' primer) SEQ ID NO. 30 
nd Ch17-NSP-2AS, 5'- ACCCACCTCCATCTGCTCGTGGATCC-3' SEQ ID 
30 NO. 31 (3' primer), were used for PCR. These two primers amplified a 
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708-base pair fragment from human mammary gland carcinoma cDNA 
(Clontech Marathon-Ready cDNA, Cat. No. 7493-1). 

To obtain the remaining, unknown cDNA of MTSP6, 5'- and 
3'-RACE reactions were performed on the human mammary gland 
5 carcinoma. Marathon-Ready cDNA is specifically made for RACE 
reactions. The first RACE reactions were performed by PCR using 
Marathon cDNA adaptor primer 1 (API) with gene specific primers, 
Ch17-NSP-2AS, 5'- ACCCACCTCCATCTGCTCGTGGATCC-3' SEQ ID 
NO. 31 for 5'-RACE reaction and Ch17-NSP-1, 

10 5'-TCACGCATCGTGGGTGGAACATGTCC-3' SEQ ID NO. 30 for 3'-RACE 
reaction. The PCR products were purified from agarose gel. A second 
nested PCR was then performed using Marathon cDNA adaptor primer 2 
(AP2) with gene specific primer Chi 7-NSP-3AS, 
5'- CCACAGCCTCCTCTCTTGACACACCAG-3' SEQ ID No. 32 for 

15 5'-RACE reaction (using first 5'-RACE product as template) and 

Ch17-NSP-3 5'-ACGCCCCTGTGGATCATCACTGCTGC-3' SEQ ID No. 33 
for 3'-RACE reaction (using first 3'-RACE product as template). First 5'- 
and 3'-RACE products were also used as template for PCR reactions using 
primers Ch17-NSP-3 and Ch17-NSP-4AS to obtain a cDNA fragment for 

20 use as a probe. PCR products from RACE reactions which were larger 
than 700 bp were cut out and purified from agarose gel and subcloned 
into pCR2.1-T0P0 cloning vector (Invitrogen, Carlsbad, CA). Colony 
hybridization was then performed to identify positive colonies containing 
MTSP6 sequence. Positive clones were identified by colony hybridization 

25 using the 495 bp DNA fragment obtained from PCR reaction (with primers 
Ch17-NSP-3 and Chi 7-NSP-4AS) and by DNA sequencing. 

Sequence analysis of the 5'-RACE products indicated that an 
additional 420 bp of upstream sequence were obtained. The potential 
initial codon was not present in the 5'-RACE sequence. Another round 

30 of nested 5'-RACE reaction was performed using AP2 and a gene specific 
primer (designed based on the new RACE sequence) Ch17-NSP-5AS 
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5'-TCCCTCCCTCACATATACTGAGTGGTG-3' SEQ ID No. 34, using the 
PGR products obtained from the first 5'-RACE as template. A PGR 
product of 367 bp using Gh17-NSP-6 

5'-GGACTGGTGAGGGAAGTCAGATGTCG-3' SEQ ID NO. 35 (designed 
5 based on the new 5'-RACE sequence) and Ch17-NSP-5AS was used to 
identify the positive clones. An additional sequence of 480 bp was 
obtained from the second 5'-RACE products. A potential ATG start 
codon was observed within a sequence of GTGAGGATGG (nucleotides 
262-272 of SEQ ID No. 12, which appears to be a Kozak sequence (GGC 
10 (A/G) CCAUGG), indicating that this ATG is likely the initiation codon for 
MTSP6. 

The 3'-RAGE reaction to obtain the rest of the 3' end of MTSP6 
was not successful using Marathon Ready human mammary gland 
carcinoma cDNA. The sequence of the 3'-RACE products obtained was 

15 exclusively that of an MTSP6 cDNA truncated with the Marathon AP2 
primer sequence within the coding region. 

7he 3'-end sequence of MTSP6 was obtained by PGR using 
Ch17-NSP-3 (5'-ACGGGCCTGTGGATGATGACTGGTGC-3'; SEQ ID NO. 
33) and Ch17-NSP-4 (5'-GTGGTGTGTCAAGAGAGGAGGGTGTGG-3'; 

20 SEQ ID NO. 37) with an antisense primer Gh17-NSP-7AS 

{5'-AGTCAGGTGGCTAGTTATCGCGTTCCTG-3'; SEQ ID NO. 38) 
designed based on the sequence of an EST clone AA883068, which 
apparently covers the 3'-end of MTSP6 sequence, and human small 
intestine cDNA (Glontech) as template. Two PGR products (650 bp and 

25 1 82 bp, respectively) were obtained and DNA sequence analysis indicated 
that both PGR products contained a stop codon. 
Sequence analysis and domain organization of MTSP6 

The MTSP6 DNA and protein sequences were analyzed using DNA 
Stridor (version 1.2). The ORE of MTSP6 is composed of 1,362 bp, 

30 which translate into a 453-amino acid protein. Protein sequence analysis 
using the SMART (Simple Modular Architecture Research Tool) program 
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at http://smart.embl-heidelberg.de predicts that MTSPB is a multi-domain, 
type-ll membrane-type serine protease containing of a transmembrane 
domain (amino acids 48-68) at the N terminus followed by a LDLRa 
domain (LDL receptor domain class a) (amino acids 72-108), a SR domain 
(Scavenger receptor Cys-rich domain)(amino acids 109-205), and a 
trypsin-like serine protease domain (amino acids 216-443) (see 
FIGURE 3). 

Gene expression profile of (VITSPG in normal and tumor tissues 

To obtain information regarding the gene expression profile of the 
MTSP6 transcript, a 495 bp DNA fragment obtained from PGR reaction 
with primers Ch17-NSP-3 and NSP-4AS was used to probe an RNA blot 
composed of 76 different human tissues (catalog number 7775-1; human 
multiple tissue expression (MTE) array; CLONTECH). The strongest signal 
was observed in duodenum. Signal in other tissues were observed in 
(decreased signal level): Stomach > trachea = mammary gland = 
thyroid gland = salivary gland = pituitary gland = pancreas > kidney > 
lung > jejunum = ileum = ilocecum = appendix = fetal kidney > fetal 
lung. Very weak signals can also be detected in several other tissues. 
MTSP6 is also expressed in several tumor cell lines including HeLa S3 > 
colorectal adenocarcinoma (SW480) > leukemia l\/IOLT-4 > leukemia 
K-562. PGR analysis of the MTSP6 transcript from cDNA libraries made 
from several human primary tumors xenografted in nude mice (human 
tumor multiple tissue cDNA panel, catalog number K1 522-1, GLONTEGH) 
was performed using MTSP6-specific primers {Gh17-NSP-3 and 
Gh17-NSP2AS). The MTSP6 transcript was strongly detected in lung 
carcinoma (LX-1), moderately detected in pancreatic adenocarcinoma 
(GI-103), weakly detected in ovarian carcinoma (GI-102); and very weakly 
detected in colon adenocarcinoma (GI-1 12 and GX-1), breast carcinoma 
(GI-101), lung carcinoma (GI-1 17) and prostatic adenocarcinoma (PG3). 
The MTSP6 transcript was also detected in breast cancer cell line 
MDA-MB-231, prostate cancer cell line PG-3, but not in HT-1080 human 
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fibrosarcoma cell line. MTSP6 is also expressed in mammary gland 
carcinoma cDNA (Clontech). 

EXAMPLE 4 

Expression of the protease MTSP domains 

The DNA encoding each of the MTSP 3 and 4 protease domains 

was cloned into a derivative of the Pichia pastoris vector pPIC9K 

(available from Invitrogen; see SEQ ID NO. 45). Plasmid pPIC9k features 

include the 5' A0X1 promoter fragment at 1-948; 5' A0X1 primer site at 

855-875; alpha-factor secretion signal(s) at 949-1218; alpha-factor 

primer site at 1 1 52-1 1 72; multiple cloning site at 1 1 92-1 241 ; 3' A0X1 

primer site at 1327-1347; 3' A0X1 transcription termination region at 

1253-1586; HIS4 ORF at 4514-1980; kanamycin resistance gene at 

5743-4928; 3' A0X1 fragment at 6122-6879; ColEI origin at 

7961-7288; and the ampicillin resistance gene at 8966-8106. The 

plasmid used herein is derived from pPIC9K by eliminating the Xhol site in 

the kanamycin resistance gene and the resulting vector is herein 

designated pPIC9KX. 

Primers used for PCR amplification of protease domain and 
subcloning into the Xhol/NotI sites of Pichia vector 

MTSP3 

5' primer (with Xhol site [underlined]) SEQ ID No. 39 

5' TCTCICGAGAAAAGAGTGGTGGGTGGGGAGGAGGCCTCTGTG 3' 

3' primer (with NotI site [underlined]) SEQ ID No. 40 

5' ATTC GCGGCCGC ATTACAGCTCAGCCTTCCAGAC 3' 

MTSP4-S and MTSP4-L 
5' primer (with Xhol site [underlined]) SEQ ID No. 41 

5' TCTCICGAGAAAAGAATTGTTGGTGGAGCTGTGTCCTCCGAG 
3' primer with NotI site SEQ ID No, 42 

5' ATTCGCGGCCGCTCAGGTCACCACTTGCTGGATCCAG 3' 
MTSPa 
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MTSP6 was cloned into the £. co// TOPO vector (pcR® 2.1 TOPO™, 
SEQ ID No. 46, Invitrogen, Carlsbad, CA; the TOPO® TA Cloning® Kit is 
designed form cloning Taq-amplified PRCR products). 
5' primer (with Xhol site [underlined]) SEQ ID No. 43 



3' primer NotI site comes from E. coli TOPO vector SEQ ID No. 44: 
5' ACTCAGGTGGCTACTTATCCCCTTCCTC 3' 

EXAMPLE 5 

Assays for identification of candidate compounds that modulate that 
1 0 activity of an MTSP 

Assay for identifying inhibitors 

The ability of test compounds to act as inhibitors of catalytic 
activity of an MTSP, including MTSP1, MTSP3, MTSP4, MTSP6 can be 
assessed in an amidolytic assay. The inhibitor-induced inhibition of 

15 amidolytic activity by a recombinant MTSP or the protease domain 
portions thereof, can be measured by IC50 values in such an assay. 

An exemplary assay buffer is HBSA (10 mM Hopes, 1 50mM 
sodium chloride, pH 7.4, 0.1% bovine serum albumin). All reagents were 
from Sigma Chemical Co. (St. Louis, MO), unless otherwise indicated. 

20 Two IC50 assays at 30-minute (a 30-minute preincubation of test 
compound and enzyme) and at 0-minutes (no preincubation of test 
compound and enzyme) are conducted. For the IC50 assay at 30-minute, 
the following reagents are combined in appropriate wells of a Corning 
microtiter plate: 50 microliters of HBSA, 50 microliters of the test 

25 compound, diluted (covering a broad concentration range) in HBSA (or 
HBSA alone for uninhibited velocity measurement), and 50 microliters of 
the MTSP or protease domain thereof diluted in buffer, yielding a final 
enzyme concentration of about 100-500 pM. Following a 30-minute 
incubation at ambient temperature, the assay is initiated by the addition 

30 of 50 microliters of a substrate for the particular MTSP (see, e.g,, table 
and discussion below) and reconstituted in deionized water, followed by 



5 



5' CTCGAGAAACGCATCGTGGGTGGAAACATGTCCTTG 3' 



-177- 



24745-1 607 



dilution in HBSA prior to the assay) were added to the wells, yielding a 
final volume of 200 microliters and a final substrate concentration of 300 
/jM (about 1.5-times Km). 

For an IC50 assay at 0-minute, the same reagents are combined: 
50 microliters of HBSA, 50 microliters of the test compound, diluted 
(covering the identical concentration range) in HBSA (or HBSA alone for 
uninhibited velocity measurement), and 50 microliters of the substrate, 
such as a chromogenic substrate. The assay is initiated by the addition of 
50 microliters of MTSP. The final concentrations of all components are 
identical in both IC50 assays (at 30- and 0-minute incubations). 

The initial velocity of the substrate hydrolysis is measured in both 
assays by, for example for a chromogenic substrate, as the change of 
absorbance at a particular wavelength, using a Thermo Max^? Kinetic 
Microplate Reader (Molecular Devices) over a 5 minute period, in which 
less than 5% of the added substrate was used. The concentration of 
added inhibitor, which caused a 50% decrease in the initial rate of 
hydrolysis was defined as the respective IC50 value in each of the two 
assays (30-and 0-minute). 

Another assay for identifying inhibitors 

Test compounds for inhibition of the protease activity of the 
protease domain of is assayed in Costar 96 well tissue culture plates 
(Corning NY). Approximately 2-3 nM the MTSP or protease domain 
thereof is mixed with varying concentrations of inhibitor in 29,2 mM Tris, 
pH 8.4, 29.2 mM imidazole, 217 mM NaCI (100 mL final volume), and 
allowed to incubate at room temperature for 30 minutes, 400 mM 
substrate is added, and the reaction monitored in a SpectraMAX Plus 
microplate reader (Molecular Devices, Sunnyvale CA) by following the 
change in a parameter correlated with hydrolysis, such as absorbance for 
a chromogenic substrate for 1 hour at 37° C. 
ASSAY FOR SCREENING IVITSPS 
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The protease domain of MTSP6 expressed in Pichia pastoris is 
assayed for inhibition by various connpounds in Costar 96 well tissue 
culture plates (Corning NY). Approximately 1-20 nM MTSP6 is mixed 
with varying concentrations of inhibitor in 29.2 mM Tris, pH 8.4, 29.2 
mM Imidazole, 217 mM NaCI (100/yL final volume), and allowed to 
incubate at room temperature for 30 minutes. 500 jjM substrate 
Spectrozyme t-PA (American Diagnostica,, Greenwich, CT) is added, and 
the reaction is monitored in a SpectraMAX Plus microplate reader 
(Molecular Devices, Sunnyvale CA) by measuring the change in 
absorbance at 405 nm for 30 minutes at 37°C. 
Identification of substrates 

Particular substrates for use in the assays can be identified 
empirically by testing substrates. The following list of substrates are 



exemplary of those that can be tested. 



Substrate name 


Structure 


S 2366 


pyroGlu-Pro-Arg-pNA.HCI 


spectrozyme t-PA 


CH3S02-D-HHT-Gly-Arg-pNA.AcOH 


N-p-tosyl-Gly-Pro-Arg- 
pNA 


N-p-tosyl-Gly-Pro-Arg-pNA 


Benzoyl-Val-Gly-Arg- 
pNA 


Benzoyl-Val-Gly-Arg-pNA 


Pefachrome t-PA 


CHgSOj-D-HHT-Gly-Arg-pNA 


S 2765 


N-a-Z-D-Arg-Gly-Arg-pNA.2HCI 


S 2444 


pyroGlu-Gly-Arg-pNA.HCI 


S 2288 


H-D-IIe-Pro-Arg-pNA.2HCI 


spectrozyme UK 


Cbo-L-(K)Glu(£7-t-BuO)-Gly-Arg-pNA.2AcOH 


S 2302 


H-D-Pro-Phe-Arg-pNA.2HCI 


S 2266 


H-D-Val-Leu-Arg-pNA.2HCl 


S 2222 


Bz-lle-GIu{g-OR)-Gly-Arg-pNA.HCI 
R = H{50%) and R = CH3{50%} 


Chromozyme PK 


Benzoyl-Pro-Phe-Arg-pNA 


S 2238 


H-D-Phe-Pip-Arg-pNA.2HCI 


S 2251 


H-D-Val-Leu-Lys-pNA.2HCI 


Spectrozyme PI 


H-D-Nle-HHT-Lys-pNA.2AcOH 




Pyr-Arg-Thr-Lys-Arg-AMC 




H-Arg-Gln-Arg-Arg-AMC 
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Boc-Gln-Gly-Arg-AMC 




Z-Arg-Arg-AMC 


Spectrozyme THE 


H-D-HHT-Ala-Arg-pNA.2AcOH 


Spectrozyme fXlla 


H-D-CHT-Gly-Arg-pNA.2AcOH 




CVS 2081-6 (MeSOj-dPhe-Pro-Arg-pNA) 




Pefachrome fVlla (CHgSOa-D-CHA-But-Arg-pNA) 



pNA = para-nitranilide (chromogenic) 
AMC = amino methyl coumarin (fluorescent) 



If none of the above substrates are cleaved, a coupled assay, 
10 described above, can be used. Briefly, test the ability of the protease to 
activate and enzyme, such as plasminogen and trypsinogen. To perform 
these assays, the single chain protease is incubated with a zymogen, 
such as plasminogen or trypsinogen, in the presence of the a known 
substrate, such, lys-plasminogen, for the zymogen. If the single chain 
15 activates the zymogen, the activated enzyme, such as plasmin and 
trypsin, will degrade the substrate therefor. 

EXAMPLE 6 
Isolation and Cloning of Matriptase 

A, Cell type and growth of cells 

20 Human prostate adenocarcinoma cell line, PC-3, was purchased 

from ATCC (catalog number CRL-1435; Manassas, VA). The cells were 
cultured at 37°C, 5% CO2 in Ham's F-12K growth medium (catalog 
number 9077; Irvine) supplemented with 2 mM L-glutamine and 10% 
fetal bovine serum. All subsequent cell manipulations were carried out 

25 according to the manufacturer's instructions, PC-3 cells were allowed to 
grow to about 90% confluence, and were then washed briefly with 1x 
phosphate buffered saline. 

B. Isolation of total RNA, and purification and enrichment of 

polyA^ RNA 

30 PC-3 cells were lysed in Trizol reagent (catalog number 15596; Life 

Technologies, Rockville, MD) and total RNA was isolated according to the 
manufacturer's protocol. The concentration of total RNA was estimated 
from absorbance reading at 260 nm. PolyA"" RNA was purified and 
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enriched using oligo-dT beads (catalog number 70061; Oligotex, Qiagen, 
Valencia, CA). 

C. Reverse-transcription and polymerase chain reaction (PGR) 

PC-3-derived polyA"" RNA was converted to single-stranded cDNA 
5 (sscDNA) by reverse transcription using ProSTAR first-strand RT-PCR kit 
(catalog number 200420; Stratagene, La Jolla, CA) and Superscript 11 
RNase H' reverse transcriptase (catalog number 1 8064-022; Life 
Technologies). After reverse transcription, an aliquot of PC-3 sscDNA (4 
/jL) was subjected to PGR using 2 mM each of the sense and anti-sense 
10 degenerate oligonucleotide primers and Tag polymerase (catalog number 
201203; Qiagen). Total reaction volume was 100 jjL. The sequence of 
the sense primer was 5'TGGRT(l)VT(l)WS(l)GC(l)RC(l)CAYTG-3' (SEQ ID 
No. 13) and that of the anti-sense was 5'- 

(l)GG(l)CC(l)CC(l)SWRTC(l)CCYT(l)RCA(l)GHRTC-3' (SEQ ID No. 14), 
15 where R = A,G; V = G,A,C; W = A,T; S = G,C; Y = C,T; H = A,T,C. The 
primer sequences correspond to two highly conserved regions in all 
chymotrypsin-like serine proteases and amplify PGR products ranging 
from approximately 400 to 500 base pairs. 

D. Clone screening and sequencing 

20 The PGR products were separated on a 2% agarose gel and purified 

using a gel extraction kit (catalog number 28706; QlAquick gel extraction 
kit; Qiagen). The purified DNA fragments were ligated into pCR2.1-T0P0 
(catalog number K4500-01; Invitrogen, Carlsbad, CA). After 
transformation into £. co/i cells, plasmid DNA was isolated and analyzed 

25 by digestion with EcoRl restriction enzyme. Clones that had inserted 

nucleic acid were further characterized by sequencing using a fluorescent 
dye-based DNA sequencing method (catalog number 4303149; BigDye 
terminator cycle sequencing kit with AmpliTag DNA polymerase; Perkin 
Elmer, Lincoln, CA). A total of 31 clones were sequenced and analyzed. 

30 All sequences were analyzed by a multiple nucleotide sequence alignment 
algorithm (blastn) (www.ncbi.nlm.nih.gov/blast) to identify identical or 
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closely related DNA deposited in GenBank (NCBI, Bethesda, MD). Those 
that did not show significant honnologY were further analyzed using 
blastx, which compares the six-frame conceptual translation products of a 
nucleotide sequence (both strands) against a protein sequence database 
(SwissProt)- Eight clones yielded identical cDNA fragments that encode 
MTSP1. MTSP1 was subsequently found to be identical to matriptase 
(GenBank accession number AP1 18224). 

E. Rapid amplification of cDNA ends (RACE) and gene-specific 
amplification of MTSP1 

To obtain DNA encoding the complete protease domain of MTSP1, 

RACE and gene-specific amplification reactions were performed. A 

human prostate Marathon-Ready cDNA (catalog # 7418-1; Clontech) was 

used to isolate part of the cDNA encoding MTSPI . Marathon-Ready 

cDNA is prepared to contain a known hybridization sequence at the 5' 

and 3' ends of the sscDNA. The 3' region of MTSPI cDNA was obtained 

by a 3'-RACE reaction using a gene specific primer, 5'- 

CACCCCTTCTTCAATGACTTCACCTTCG-3' (SEQ ID No. 55), The 5' end 

of the MTSPI protease domain was obtained by gene-specific 

amplification reaction using two MTSPI -specific primers, 5'- 

TACCTCTCCTACGACTCC-3' (SEQ ID No. 56) for the sense primer and 

5'-GAGGTTCTCGCAGGTGGTCTGGTTG-3^ (SEQ ID No. 57) for the 

antisense primer. The sequences for these two primers were obtained 

from the human SNC19 mRNA sequence. The 3'-RACE reaction and 

gene-specific PGR produced DNA fragments that were > 1 kbp in size. 

These fragments were subcloned into pCR2.1-T0P0 (Invitrogen, San 

Diego, OA), After transformation into E. co// cells, plasmid DNA was 

isolated and analyzed by digestion with EcoH\ restriction enzyme. Clones 

that had inserts were characterized by Southern blot analysis (using the 

internal cDNA fragment as probe) and by DNA sequence analysis. 

F. PGR amplification of cDNA encoding the protease domain of 
MTSPI 



-182- 





24745-1607 



To obtain a cDNA fragment encoding the entire protease domain of 
MTSPl, an end-to-end PGR amplification using gene-specific primers was 
used. The two primers used were: 5'- 

CTCGAGAAAAGAGTTGTTGGGGGCACGGATGCGGATGAG-3' (SEQ ID 

5 No. 58) for the 5' end and 5'- 

GCGGCCGCACTATACCCCAGTGTTCTCTTTGATCCA-3' (SEQ ID No, 36 

for the 3' end. The 5' primer contained the sequence that encodes the 

start of the MTSPl protease domain (VVGGTDADE) (SEQ. ID. NO. 10). 

The 3' primer contained the stop codon of MTSPl. A -800-bp fragment 

10 was amplified, purified and subcloned into the Pichia pastoris expression 

vector, pPIC9K, resulting in pPlC9K-l\/ITSP1 . 

G. Gene expression profile of MTSPl in normal tissues, cancer 
cells and cancer tissues 

To obtain information regarding the tissue distribution and gene 

15 expression level of MTSPl, the DNA insert from pPIC9K-MTSP1 was used 
to probe a blot containing RNA from 76 different human tissues (catalog 
number 7775-1; human multiple tissue expression (MTE) array; 
CLONTECH, Palo Alto, CA). Significant expression was observed in the 
colon (ascending, transverse and descending), rectum, trachea, 

20 esophagus and duodenum. Moderate expression levels were observed in 
the jejunum, ileum, ilocecum, stomach, prostate, pituitary gland, 
appendix, kidney, lung, placenta, pancreas, thyroid gland, salivary gland, 
mammary gland, fetal kidney, and fetal lung. Lower expression levels 
were seen in the spleen, thymus, peripheral blood leukocyte, lymph node, 

25 bone marrow, bladder, uterus, liver, adrenal gland, fetal heart, fetal liver, 
fetal spleen, and fetal thymus. A significant amount of the MTSPl 
transcript was also detected in colorectal adenocarcinoma cell line 
(SW480), Burkitt's lymphoma cell line (Daudi), and leukemia cell line (HL- 
60). RT-PCR of the MTSPl transcript in several human primary tumors 

30 xenografted in athymic nude mice was performed using gene-specific 
primers. A high level of MTSPl transcript was detected in colon 
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adenocarcinoma (CX-I) and pancreatic adenocarcinoma {GI-103). 
Moderate levels were observed in another colon adenocarcinoma 
(GI-112), ovarian carcinoma (Gl-102), lung carcinoma (LX-1), and breast 
carcinoma (GI-101). Another lung carcinoma (GI-117) expressed a low 
5 level of the MTSPl transcript. A similar RT-PCR was performed to detect 
the presence of the MTSPl transcript in PC-3 and LNCaP cell lines. Both 
cell lines expressed significant amounts of MTSPl transcript. 
H. Sequence analysis 

All derived DNA and protein sequences were analyzed using 
10 MacVector (version 6.5; Oxford Molecular Ltd., Madison, Wl). The cDNA 
encoding the protease domain of MTSPl is composed of 726 base pairs 
which translate into a 241 -amino acid protein sequence (rMAP) (see SEQ 
ID No. 1, 2, 49 and 50). 

EXAMPLE 7 

15 Production of Recombinant Serine Protease Domain of Matriptase or 
MTSPl (rMAP) 

A. Fermentation 

The production of multi-milligram amounts of rMAP was carried out 
by fermentation in a BioFlo 3000 fermentor (New Brunswick Scientific, 

20 NJ) equipped with a 3.3 L capacity bioreactor using a 

SMD1 168/pPIC9K:MTSP1 Sac SCI clone. ZA001 complex media (10 g/L 
yeast extract, 20 g/L peptone, 40 g/L glycerol, 5 g/L ammonium sulfate, 
0.2 g/L calcium sulfate dihydrate, 2 g/L magnesium sulfate heptahydrate, 
2 g/L potassium sulfate, 25 g/L sodium hexametaposphate, 4.35 ml/L 

25 PTM1) was inoculated with 100 ml of an overnight culture of the 
P, pastoris transformant. The culture was supplemented with 50% 
glycerol by fed-batch phase and induced for 18-24 hours with methanol 
controlled at 0.025%. 
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B. Purification of Recombinant Serine Protease Domain of 
Matriptase or MTSP1 (rMAP) 

The rMAP was secreted into the culture medium, so the first step 

of the purification involved the removal of cells and cell debris by 

5 centrifugation at 5000 g for 30 minutes. The resulting supernatant was 

decanted, adjusted to pH 8.0 with 10 N NaOH, and filtered through a 

SartoBran 300 0.45 + 0.2 /jM capsule. This supernatant was 

concentrated to 1 L by ultrafiltration using a 10 kDa ultrafiltration 

cartridge (NC SRT UF system with AG/Technologies UFP-10-C-5A filter), 

10 and the buffer was exchanged by crossflow filtration into 50 mM tris-HCI, 
50 mM NaCL 0.05% tween-80, pH 8.0 (buffer A). The filtration unit was 
rinsed once with 1 L buffer A which was combined with the concentrate. 

The concentrated rMAP-containing solution was passed over a 150 
ml benzamidine column that had been equilibrated with buffer A, at a flow 

15 rate of 8 ml/min. The column was washed with 3 column volumes of 50 
mM tris-HCI, 1.0 M NaCI, 0.05% tween-80, pH 8.0 (buffer B) and eluted 
with 3 column volumes of 50 mM tris-HCI, 1 .0 M L-arginine, 0.05% 
tween-80, pH 8.0 (buffer C). Fractions containing rMAP were identified 
by activity assay and pooled. This pooled material was concentrated to 

20 10 ml using a JumboSep concentrator (Pall Gelman) and a 10 kDa cutoff 
membrane. Once concentrated to 10 ml, the buffer was exchanged into 
50 mM Na2HP04, 125 mM NaCI, pH 5.5 (buffer D) and the volume 
adjusted to 5-10 ml. The retentate was removed and the concentrator 
washed with buffer D which was added to the concentrate. The total 

25 sample volume was adjusted 15 ml. 

The partially purified rMAP was passed through a 5 ml Q-sepharose 
Fast Flow HiTrap column (Amersham-Pharmacia Biotech) pre-equilibrated 
with 15 ml of buffer D. The flow through was collected. The HiTrap 
column was washed with an additional 10 ml of buffer D, Both flow 

30 throughs were pooled, and the protein concentration was determined by 
measurement of OD280 (using an extinction coefficient of 2.012 
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mg/OD28o)" Purified rMAP was then deglycosylated by the addition 0.1 /yl 
of Endoglycosidase H (ProZyme, 5 U/ml) per mg of protein and incubating 
overnight at 4°C with gentle swirling. 

The conductivity of the deglycosylated pool was adjusted to 2.0- 
5 3.0 mS/cm with Nanopure H2O and the pH adjusted to 6.5 (-200-300 mL 
final volume). The rMAP was then further purified by anion exchange 
chromatography by loading directly onto a Pharmacia Akta Explorer 
system using a 7 mL Source 15Q anion exchange column (Amersham- 
Pharmacia Biotech). The protein was eluted in a buffer containing 50 mM 

10 HEPES, pH 6.5 with a 0-0.33 M NaCI gradient over 10 column volumes at 
a flow rate of 6 ml/min. Fractions containing protein were pooled, and 
benzamidine was added to a final concentration of 10 mM. Protein purity 
was examined by SDS-PAGE and protein concentration determined by 
measurement of OD280 and use of a theoretical extinction coefficient of 

15 2.012 mg/OD28o- 



activity was assessed by determining the inhibitor-induced inhibition of 
amidolytic activity by the MAP, as measured by IC50 values. The assay 
buffer was HBSA (10 mM Hopes, 150mM sodium chloride, pH 7.4, 0.1% 
bovine serum albumin). All reagents were from Sigma Chemical Co. (St. 

25 Louis, MO), unless otherwise indicated. 

Two IC50 assays (a) one at either 30-minutes or 60-minutes (a 30- 
minute or a 60-minute preincubation of test compound and enzyme) and 
(b) one at 0-minutes (no preincubation of test compound and enzyme) 
were conducted. For the IC50 assay at either 30-minutes or 60-minutes, 

30 the following reagents were combined in appropriate wells of a Corning 
microtiter plate: 50 microliters of HBSA, 50 microliters of the test 



EXAMPLE 8 



Assays 



20 



Amidolytic Assay for Determining Inhibition of Serine 
Protease Activity of IVIatriptase or IVITSPI 

The ability of test compounds to act as inhibitors of rMAP catalytic 
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compound, diluted (covering a broad concentration range) in HBSA (or 
HBSA alone for uninhibited velocity measurement), and 50 microliters of 
the rMAP (Corvas International) diluted in buffer, yielding a final enzyme 
concentration of 250 pM as determined by active site filtration. Following 
5 either a 30-minute or a 60-minute incubation at ambient temperature, the 
assay was initiated by the addition of 50 microliters of the substrate S- 
2765 (N-a-Benzyloxycarbonyl-D-arginyl-L-glycyl-L-arginine-p-nitroaniline 
dihydrochloride; DiaPharma Group, Inc.; Franklin, OH) to each well, 
yielding a final assay volume of 200 microliters and a final substrate 

10 concentration of 100 jjM (about 4-times K^). Before addition to the assay 
mixture, S-2765 was reconstituted in deionized water and diluted in 
HBSA. For the IC50 assay at 0 minutes; the same reagents were 
combined: 50 microliters of HBSA, 50 microliters of the test compound, 
diluted (covering the identical concentration range) in HBSA (or HBSA 

15 alone for uninhibited velocity measurement), and 50 microliters of the 
substrate S-2765. The assay was initiated by the addition of 50 
microliters of rMAP. The final concentrations of all components were 
identical in both IC50 assays (at 30- or 60- and 0-minute). 

The initial velocity of chromogenic substrate hydrolysis was 

20 measured in both assays by the change of absorbance at 405 nM using a 
Thermo Max® Kinetic Microplate Reader (Molecular Devices) over a 5 
minute period, in which less than 5% of the added substrate was used. 
The concentration of added inhibitor, which caused a 50% decrease in 
the initial rate of hydrolysis was defined as the respective IC50 value in 

25 each of the two assays (30- or 60-minutes and 0-minute). 

In vitro enzyme assays for specificity determination 
The ability of compounds to act as a selective inhibitor of 
matriptase activity was assessed by determining the concentration of test 
compound that inhibits the activity of matriptase by 50%, {IC50) as 

30 described in the above Example, and comparing IC50 value for matriptase 
to that determined for all or some of the following serine proteases: 
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thrombin, recombinant tissue plasminogen activator (rt-PA), plasmin, 
activated protein chymotrypsin, factor Xa and trypsin. 

The buffer used for all assays was HBSA (10 mM HEPES, pH 7.5, 
150 mM sodium chloride, 0.1% bovine serum albumin). The assay for 
5 ICgo determinations was conducted by combining in appropriate wells of a 
Corning microtiter plate, 50 microliters of HBSA, 50 microliters of the test 
compound at a specified concentration (covering a broad concentration 
range) diluted in HBSA (or HBSA alone for Vq (uninhibited velocity) 
measurement), and 50 microliters of the enzyme diluted in HBSA. 

10 Following a 30 minute incubation at ambient temperature, 50 microliters 
of the substrate at the concentrations specified below were added to the 
wells, yielding a final total volume of 200 microliters. The initial velocity 
of chromogenic substrate hydrolysis was measured by the change in 
absorbance at 405 nm using a Thermo Max® Kinetic Microplate Reader 

15 over a 5 minute period in which less than 5% of the added substrate was 
used. The concentration of added inhibitor which caused a 50% 
decrease in the initial rate of hydrolysis was defined as the IC50 value. 
Thrombin (flla) Assay 

Enzyme activity was determined using the chromogenic substrate, 

20 Pefachrome t-PA {CH3S02-D-hexahydrotyrosine-glycyl-L-Arginine-p- 
nitroaniline, obtained from Pentapharm Ltd.). The substrate was 
reconstituted in deionized water prior to use. Purified human a-thrombin 
was obtained from Enzyme Research Laboratories, Inc. The buffer used 
for all assays was HBSA (10 mM HEPES, pH 7.5, 150 mM sodium 

25 chloride, 0,1 % bovine serum albumin). 

IC50 determinations were conducted where HBSA (50 //L), 
a-thrombin (50 (the final enzyme concentration is 0.5 nM) and inhibitor 
(50 jj\) (covering a broad concentration range), were combined in 
appropriate wells and incubated for 30 minutes at room temperature prior 

30 to the addition of substrate Pefachrome-t-PA (50 //I) (the final substrate 
concentration is 250 //M, about 5 times Km). The initial velocity of 
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Pefachrome t-PA hydrolysis was measured by the change in absorbance 
at 405 nm using a Thermo Max® Kinetic Microplate Reader over a 5 
minute period in which less than 5% of the added substrate was used. 
The concentration of added inhibitor which caused a 50% decrease in the 
5 initial rate of hydrolysis was defined as the IC50 value. 
Factor Xa 

Factor Xa catalytic activity was determined using the chromogenic 
substrate S-2765 (N-benzyloxycarbonyl-D-arginine-L-glycine-L-arginine-p- 
nitroaniline), obtained from DiaPharma Group (Franklin, OH). All 

10 substrates were reconstituted in deionized water prior to use. The final 
concentration of S-2765 was 250 //M (about 5-times Km). Purified 
human Factor X was obtained from Enzyme Research Laboratories, Inc. 
(South Bend, IN) and Factor Xa (FXa) was activated and prepared from it 
as described [Bock, P.E., Craig, P.A., Olson, S.T., and Singh, P. Arch, 

15 Biochem. Biophys, 273:375-388 (1989)]. The enzyme was diluted into 
HBSA prior to assay in which the final concentration was 0.25 nM. 
Recombinant tissue plasminogen activator (rt-PA) Assay 

rt-PA catalytic activity was determined using the substrate, 
Pefachrome t-PA (CH3S02-D-hexahydrotyrosine-glycyI-L-arginine-p-nitro- 

20 aniline, obtained from Pentapharm Ltd.). The substrate was made up in 
deionized water followed by dilution in HBSA prior to the assay in which 
the final concentration was 500 micromolar (about 3-times Km). Human 
rt-PA (Activase®) was obtained from Genentech Inc. The enzyme was 
reconstituted in deionized water and diluted into HBSA prior to the assay 

25 in which the final concentration was 1.0 nM. 

Plasmin Assay 

Plasmin catalytic activity was determined using the chromogenic 
substrate, S-2366 [L-pyroglutamyl-L-prolyl-L-arginine-p-nitroaniline 
hydrochloride], which was obtained from DiaPharma group. The 
30 substrate was made up in deionized water followed by dilution in HBSA 
prior to the assay in which the final concentration was 300 micromolar 
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(about 2.5-tinnes Km), Purified human plasmin was obtained from Enzyme 
Research Laboratories, Inc. The enzyme was diluted into HBSA prior to 
assay in which the final concentration was 1 .0 nM. 
Activated Protein C (aPC) Assay 

aPC catalytic activity was determined using the chromogenic 
substrate, Pefachrome PC (deita-carbobenzloxy-D-iysine-L-prolyl-L- 
arginine-p-nitroaniline dihydrochloride), obtained from Pentapharm Ltd.). 
The substrate was made up in deionized water followed by dilution in 
HBSA prior to the assay in which the final concentration was 400 
micromolar (about 3-times Km). Purified human aPC was obtained from 
Hematologic Technologies, Inc. The enzyme was diluted into HBSA prior 
to assay in which the final concentration was 1 .0 nM. 
Chymotrypsin Assay 

Chymotrypsin catalytic activity was determined using the 
chromogenic substrate, S-2586 (methoxy-succinyl-L-arginine-L-prolyl-L- 
tyrosyl-p-nitroanilide), which was obtained from DiaPharma Group. The 
substrate was made up in deionized water followed by dilution in HBSA 
prior to the assay in which the final concentration was 100 micromolar 
(about 9-times Km). Purified {3X-crystallized; CDI) bovine pancreatic 
alpha-chymotrypsin was obtained from Worthington Biochemical Corp. 
The enzyme was reconstituted in deionized water and diluted into HBSA 
prior to assay in which the final concentration was 0.5 nM. 
Trypsin Assay 

Trypsin catalytic activity was determined using the chromogenic 
substrate, S-2222 (benzoyl-L-isoleucine-L-glutamic acid-[gamma-methyl 
ester]-L-arginine-p-nitroanilide), which was obtained from DiaPharma 
Group. The substrate was made up in deionized water followed by 
dilution in HBSA prior to the assay in which the final concentration was 
250 micromolar (about 4-times Km). Purified (3X-crystallized; TRL3) 
bovine pancreatic trypsin was obtained from Worthington Biochemical 
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Corp. The enzyme was reconstituted in deionized water and diluted into 
HBSA prior to assay in which the final concentration was 0,5 nM. 

Since modifications will be apparent to those of skill in this art, it is 
5 intended that this invention be limited only by the scope of the appended 
claims. 
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